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ATTN: Technology Center 1600 - Driector Doll 

P.O. Box 1450 

Alexandria, VA 22313-1450 
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Service on May 10, 2004 in an envelope as "EXPRESS MAIL POST OFFICE TO ADDRESSEE" service under 37 C.F.R. § 1.10, Express 
Mail Label Number EL 992 784 638 US addressed to: Assistant Commissioner for Patents, ATTN: Technology Center 1600 - Director 
Doll, P.O. Box 1450, Alexandria, VA 22313-1450 

By: 




Jennifer B. Xistris 

REMARKS 

This is a Protest for application no. 09/845,416, filed 04/30/01. Third Party Protestors 
believe no fee is required but if the Commissioner deems otherwise he is authorized to charge 
Deposit Accout No. 08-1290. 

A copy of this Protest is also being forwarded on this day to counsel of record: David 
A. Einhorn, Anderson, Kill, & Olick, P.C., 1251 Avenue of the America, New York, NY 
10020 in an envelope as "Express Mail Post Office to Addressee" service under 37 C.F.R. 
1.10, Express Mail Label No. EL 658 779 192 US. 

DATE: May 10, 2004 By: ^ 
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IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 

In re Application of: Xiao 

Serial No.: 09/845,416 Group No.: 1634 

Filed: 04/30/01 Examiner: B. Whiteman 

Entitled: DNA Sequences Encoding Dystrophin Minigenes and Methods of 

Use Thereof 

Protest Under 37 CFR 1.291, and 
Request to Withdraw from Issue Under 37 CFR 1.313, 
Based on Recently Published 102(e) Reference 



Assistant Commissioner for Patents 

Attn: Technology Center 1600 - Director Doll 

P.O. Box 1450 

Alexandria, VA 22313-1450 



CERTIFICATE OF MAILING UNDER 37 C.F.R. §1.10 

I hereby certify that this correspondence (along with any referred to as being attached or enclosed) are being 
deposited with the U.S. Postal Service on May 7, 2004 in an envelope as "EXPRESS MAIL POST OFFICE TO 
ADDRESSEE" service under 37 C.F.R. 1.10, Express Mail Label No. EL 992784638 US addressed to Assistant 
Commissioner for Patents, P.O.Box 1450, Alexandria, VA, ATTN: John Doll and Brian Whiteman. 

Dated: May 10 n 7QOd r X .7^*^ ^ ^ 

// Jennifer B.vCistri 




Examiner Whiteman and Director Doll: 

The following communication is presented to protest pending Application 09/845,416 
(Xiao Application) under 37 CFR 1.291, and to request that this Application be withdrawn from 
issuance at the initiative of the Office under 37 CFR 1.313. This Application was allowed on 4- 
28-04 (currently located in the Certification Division). To the extent the claims in this 
Application encompass mini-dystrophin genes and peptides with 4 rod repeats (as found in the 
claims of the corresponding Xiao Patent Pub), these claims should be withdrawn from issuance 
and rejected in view of recently published Patent Pub. US2003/02 16332 to Chamberlain et al. 
("Chamberlain Patent Pub") which anticipates such claims under 35 USC 102(e). 
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I. 4 Rod Repeats 

This Protest and request for Patent Office initiated withdrawal from issuance of the 
allowed Xiao claims is relevant to claims encompassing mini-dystrophins with 4 rod repeats. 
The Xiao Patent Pub (US2003/0171312) claims will be used in this communication as a guide to 
discuss the applicability of the Chamberlain Patent Pub as an anticipating 102(e) reference for 
claims than encompass 4 rod repeats. This discussion will find use even if the allowed Xiao 
claims differ from the Xiao Patent Pub claims. 

Claim 1 of the Xiao Patent Pub includes 4 rod repeats. This claim is reproduced below: 

1 . An isolated nucleotide sequence encoding a dystrophin minigene comprising: 

(a) a N-terminal domain; 

(b) four to six rod repeats; 

(c) an HI domain of a dystrophin gene and an H4 domain of the dystrophin gene; and 

(d) a cysteine-rich domain, 

wherein the N-terminal domain is selected from a group consisting of a N-terminal domain of the 
dystrophin gene, a modified N-terminal domain of the dystrophin gene, and a N-terminal domain 
of a utrophin gene; the rod repeats are selected from a group consisting of rod repeats in the 
dystrophin gene, rod repeats in the utrophin gene, and rod repeats in a spectrin gene; the 
cysteine-rich domain is a cysteine-rich domain of the dystrophin gene or a cysteine-rich domain 
of the utrophin gene. 

II. Chamberlain Patent Pub Anticipates Xiao Claims Under 102(e) 

The Chamberlain Patent Pub anticipates any allowed Xiao claims that encompass 4 rod 
repeats under 102(e) as the Chamberlain Patent Pub discloses 4 rod repeat mini-dystrophins in an 
Application filed prior to the filing date of the Xiao priority filing date. As shown in the time- 
line below, even though the Xiao Application claims priority to a Provisional Application that 
predates the Chamberlain Provisional, the Xiao Provisional does not disclose 4 rod repeat mini- 
dystrophins. Indeed, it explicitly excludes them. The Chamberlain Provisional, however, 
discloses 4 rod repeat mini-dystrophins - thereby predating the Xiao disclosure of 4 rod repeat 
mini-dystrophins by almost 7 months. Consequently, the Chamberlain Patent Pub is a 102(e) 
reference with regard to any Xiao claims that encompass 4 rod repeat mini-dystrophins. 

A* Overview/Time-Line 

Presented below is a brief overview of time-line of the Xiao and Chamberlain Provisional 
and Regular patent applications that shows the Chamberlain Provisional was the first to disclose 
4 rod repeat mini-dystrophins. 
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1. April 28, 2000: Xiao Provisional Application No. 60/200,777 is filed (at Tab A), 
and discloses 5 and 6 rod repeat mini-dystrophins, but does not disclose 4 rod repeat mini- 
dystrophins. This Application explicitly excludes 4 rod repeats. 

2. October 6, 2000: Chamberlain Provisional Application No. 60/238,848 is filed 
(Tab B) 5 and discloses 4 rod repeat mini-dystrophins. 

3. April 30, 2001: Xiao Regular Application No. 09/845,416 is filed (Xiao Patent 
Pub is at Tab C), and disclosed 4 rod repeat mini-dystrophins for the first time (nearly 7 months 
after Chamberlain Provisional filing date). 

4. October 4, 2001 : Chamberlain Regular Application No. 10/149,736 is filed 
(Chamberlain Patent Pub is at Tab D) claiming priority to October 6, 2000 Provisional 
Application that discloses 4 rod repeat mini-dystrophins. Current status: active prosecution of 4 
rod repeat mini-dystrophin claims. 

5. November 20, 2003: Chamberlain Patent Pub is published. PAIR indicates that 
the last substantive Office Action in the Xiao prosecution was mailed June 30, 2003. Since the 
Chamberlain Patent Pub was not published until November of 2003, it appears that Examiner 
was not aware of this reference when the Xiao claims were allowed. 

B. Xiao Provisional Does Not Disclose 4 Rod Repeats Mini-Dystrophins 

The Xiao Provisional Application, filed April 28, 2000, does not disclose 4 rod repeat 
mini-dystrophins (See Xiao Provisional at Tab A). Four rod repeat mini-dystrophins were first 
introduced by Xiao in the Xiao Regular Application filed April 30, 2001. As such, claims 
encompassing 4 rod repeats are not entitled to the Xiao Provisional filing date, but instead, only 
receive the filing date of the Xiao Regular Application. 

The lack of teaching in this Provisional is highlighted by affirmative statements about the 
need to retain at least 5 rod repeats in the mini-dystrophins. For example, the Xiao Provisional 
states: 

"To ensure sufficient physical flexibility of the protein, all of our mini-dystrophins still 
retain at least five rod repeats (Rl, R2, R22, R23 & R24) and 2 hinges (HI and H4) in 
the central rod domain (Fig. 1)." (emphasis added, page 56). 

"However, the mini-dystrophin genes reported here accomodated at least 5 rod repeats 
(Rl, R2, R22, R23 & R24) and two hinges (HI and H4). Therefore we hypothesized that 
the length of the central rod domain is the most critical factor, based on the fact that a 
major role of dystrophin is to crosslink the myo fiber cytoskelton and plasma membrane 
and stabilize the structure during muscle contractions. If the dystrophin is too short to 
span the sliding distance between the cytoskeleton and plasma membrane during muscle 
contraction, the crosslink will be disrupted and the muscle membrane will become 
unstable and prone to mechanical damages." (emphasis added, page 60). 
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The lack of teaching of 4 rod repeats in the Xiao Provisional is also seen by comparing the Xiao 
Provisional to the Xiao Regular Application. For example, it is instructive to compare Figure 1 
in the Xiao Provisional (no 4 rod repeats are disclosed) to Figure 1 in the Xiao Regular 
Application (three 4 rod repeat constructs have been inserted into the middle of the figure). The 
description of Figure 1 in each application also highlights the change from at least 5 rod repeats 
in the Xiao Provisional (deleting 19 of the 24 rod repeats from natural dystrophin) to the 4 rod 
repeats in the Xiao Regular Application (deleting 20 of the rod repeats from natural dystrophin): 

Xiao Provisional page 56: "We have created by rational design several mini-genes, in 
each deleting up to 3/4 of the central rod domain (19 rods and 2 hinges) and nearly the 
entire distal C-terminal domain (exons 71-78) (Fig. 1)." 

Xiao Patent Pub Col. 5, par. 61 : "We have created minigenes in which up to 75% of the 
central rod domain (20 of the 24 rods; 2 of the 4 hinges), as well as nearly all of the C- 
terminal domain (exons 71-78), are deleted (FIG. 1)." 

In light of the lack of teaching of 4 rod repeats in the Xiao Provisional, it is clear that claims that 
encompass 4 rod repeats in the Xiao Regular Application are not entitled to the priority date of 
the Xiao Provisional Application. 

C. Xiao Claims Are Anticipated by the Chamberlain Patent Pub Under 102(e) 

As detailed above, Xiao claims that encompass 4 rod repeats are only entitled to the April 
30, 2001 filing date of the Xiao Regular Application, not the April 28, 2000 filing date of the 
Xiao Provisional Application. The Chamberlain Provisional, however, fully discloses 4 rod 
repeat mini-dystrophins and was filed October 6, 2000. As such, the Chamberlain Patent Pub is 
a 102(e) reference that pre-dates the Xiao Application for 4 rod repeats by almost 7 months. 

i. Chamberlain Provisional Fully Discloses 4 Rod Repeats 

Unlike the Xiao Provisional, the Chamberlain Provisional fully discloses 4 rod repeats. 
Discussion of 4 rod repeats is found throughout the 142 page Chamberlain Provisional. For 
example, 4 "spectrin-like repeats" (rod repeats) are taught at the following pages: page 3, lines 13 
and 28; page 4, lines 3 and 26; page 5, lines 2, and 4-7 (SEQ IDs 39-41); page 23, lines 3-4; 
Example 2 on pages 51-54 (detailing construction of AR4-R23, AR2-R21+H3, and AR2-R1; all 
of which have 4 spectrin-like repeats); Example 3 on pages 55-58 (detailing construction of AR4- 
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R23-71-78, which is a 4 spectrin like repeats construct with a C-terminal deletion); Example 5 on 
page 63-70 shows testing of various constructs including many 4 spectrin-like repeat constructs; 
Example 6 on pages 70-71 (describing how a 4 spectrin-like construct could be inserted into 
AAV and used to treat DMD or BMD); Figures 12-14 show the nucleic acid sequences for AR4- 
R23 (micro-dys 1; SEQ ID NO:39), AR2-R21 (micro-dys 2; SEQ ID NO:40), and AR2-R21-H3 
(micro-dys 3; SEQ ID NO:41); and Figure 27, which shows four 4 spectrin-like repeat constructs 
(the last 4 constructs in Figure 27). In light of the extensive disclosure of 4 spectrin-like (rod) 
repeats in the Chamberlain application, including Examples testing these constructs in vivo on 
dystrophic mouse models, it is clear that the Chamberlain Patent Pub is fully entitled to the 
priority date of the Chamberlain provisional for 4 rod repeat mini-dystropins. 

ii. Chamberlain Patent Pub Anticipates Exemplary Claim 1 

The Chamberlain Pat Pub, based on the disclosure of the Chamberlain Provisional, fully 
anticipates Claim 1 of the Xiao Pat Pub. Claim 1 of the Xiao Patent Pub recites a nucleic acid 
sequence encoding a mini-dystropin peptide with the following four elements: 

(a) a N-terminal domain; 

(b) four to six rod repeats; 

(c) an HI domain of a dystrophin gene and an H4 domain of the dystrophin gene; and 

(d) a cysteine-rich domain. 

The Chamberlain Provisional teaches nucleic acid sequences encoding mini-dystropins with the 
four recited elements. 

As described above, the Chamberlain Provisional describes constructs with 4 rod repeats. 
For example, the last four constructs in Figure 27 shows such 4 rod repeats. These constructs 
are labeled: R4-R23 (micro-dys 1), R2-R21 (micro-dys 2), R2-R21-H3 (micro-dys 3), and R4- 
R23-71-78. Each of these four constructs contain an "HI domain 11 and an "H4 domain" as 
recited in Claim 1. These four constructs also contain a "CR" (cystein-rich domain), as well as 
an N-terminal domain (labeled ABD1 for actin-binding domain, which is the N-terminal domain; 
see, e.g, SEQ ED NO:6). 

The four elements in Claim 1 are also taught in additional locations throughout the 
Chamberlain Provisional (i.e. not just in Figure 27). For example, the Chamberlain specification 
teaches the use of an "actin-binding domain" (aka N-terminal domain) throughout the summary 
of the invention and gives SEQ ID NO:6 as an example of a 756 nucleic acid sequence that 
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encodes an actin-binding domain (N-terminal domain). Figures 12-14 depicting SEQ ED 
NOs:39-41 also teach 4 rod repeat constructs with N-terminal domains (actin-binding domains). 

The Chamberlain Provisional also teaches the use of HI and H4 in mini-dystrophins. For 
example, page 6 lines 15-26; page 9, lines 18-29; and Figures 12-14 depicting SEQ ID NOs:39- 
41 (that teach 4 rod repeat constructs containing hinges 1 and 4). Cysteine-rich domains are also 
taught on page 5, lines 23-27; in SEQ ID NO:35, and in Figures 12-14 (teaching the nucleic acid 
sequence of 4 rod repeat constructs with cystein-rich domains). 

The last part of Claim 1 specifies that the various elements (e.g. N-terminals, rod repeats, 
and cystein-rich domains) can come from various sources such as dystrophin, modified 
dystrophin, utrophin, and spectrin. The Chamberlain Provisional not only teaches dystrophin, 
but also contains a section entitled "Variants and Homologs of Dystrophin" which teaches, for 
example, variants of dystrophin, and dystrophin homologs such as utrophin and alpha-actinin 
(see page 27-35). 

In light of the teaching in the Chamberlain Provisional, it is clear that the Chamberlain 
Patent Pub anticipates exemplary Xiao Claim 1 under 35 USC 102(e). As such, allowed Xiao 
claims similar to or identical to this claim (that include the 4 rod repeats) should be withdrawn 
from issue as unpatentable over the Chamberlain Pat Pub. 

iii. Chamberlain Patent Pub Anticipates All Exemplary Claims that 
Include 4 Repeats Under 35 USC 102(e) 

The remaining exemplary claims from the Xiao Patent Pub that include 4 rod repeats are 
also anticipated by the Chamberlain Patent Pub based on the disclosure in the Chamberlain 
Provisional. A claim by claim analysis is provided below, showing where support for each 
element is found in the Chamberlain Provisional. It is noted that the support provided in the 
Chamberlain Provisional is merely exemplary and not intended to be comprehensive (i.e. 
additional support for each element may also be found throughout the remainder of the 
Provisional Application): 

Claim 2: requires that the mini-dystrophin further comprises at least 3 amino acids of a C- 
terminal domain of the dystrophin gene. Chamberlain Provisional - Figure 27 teaches the use 
of "CT" (C-terminal domain) as part of the 4 rod repeat constructs. C-terminal domains are also 
taught on page 8, lines 8-18; and SEQ ID NO:36. 
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Claim 3: essentially identical to Claim 1, but requires the nucleic acid sequence to be less than 
5,000 nucleotides in length. Chamberlain Provisional - Page 4 teaches that the nucleic acid "is 
less than 5 kilo-bases in length" and Example 6 describes a four rod repeat construct, deleted for 
exons 71-78, with a total size of 4.7kb. 

Claim 4: requires that the mini-dystrophin further comprise an H2 or H3. Chamberlain 
Provisional - Figure 27 teaches R2-R21-H3-Micro-Dys 3 that contains HI, H4 and further 
contains H3. 

Claim 5: requires 4 rod repeats specifically. Chamberlain Provisional - See discussion above 
of extensive teaching of 4 rod repeats in Chamberlain Provisional. 

Claim 11 : nucleic acid sequence consists of SEQ ID NO: 10, which is construct A353 1 
containing 4 rod repeats (Rl, R22, R23, and R24; see Figure 1 of Xiao Pat Pub). Chamberlain 
Provisional - teaches construct R2-R21-Micro-Dys 2 that contains the exact 4 repeats (i.e Rl, 
R22, R23, and R24), as well as the same hinges (HI and H4), and the use of an N-terminal, 
cystein-rich, and C-terminal domains. The sequence for Micro-Dys 2 is provided in Figure 13, 
which is SEQ ID NO:40. 

Claim 12: nucleic acid sequence consists of SEQ ID NO: 12, which is construct A35 1 0 
containing 4 rod repeats (Rl, R2, R23, and R24; see Figure 1 of Xiao Pat Pub). Chamberlain 
Provisional - teaches construct R2-R21 -Micro-Dys 2 that contains the 3 of the four repeats in 
SEQ ID NO: 12 (Micro-Dys 2 uses R22 instead of R2) Micro-Dys 2 uses the same hinges (HI 
and H4), and the use of an N-terminal, cystein-rich, and C-terminal domains. The sequence for 
Micro-Dys 2 is provided in Figure 13, which is SEQ ID NO:40. 

Claim 13: nucleic acid sequence consists of SEQ ID NO: 14, which is construct A3447 
containing 4 rod repeats (Rl, R2, R3, and R24; see Figure 1 of Xiao Pat Pub). Chamberlain 
Provisional - teaches construct R4-R23-Micro-Dys 1 that contains the exact 4 repeats (i.e. Rl, 
R2, R3, and R24), as well as the same hinges (HI and H4), and the use of an N-terminal, cystein- 
rich, and C-terminal domains. The sequence for Micro-Dys 1 is provided in Figure 11, which is 
SEQ ID NO:39. 

Claim 14: requires the nucleic acid sequence of Claim 1 linked to expression control elements 
in an AAV. Chamberlain Provisional - teaches the use of AAV vectors at pages 40-41 and 
Example 6, as well as the use of expression control elements at pages 7, lines 2-6, and pages 36- 
38, and Figures 36-37. 

Claim 15: limits claim 14 by requiring MCK or CMV promoters. Chamberlain Provisional - 

teaches the use of MCK promoters at pages 7, lines 2-6, and pages 36-38, and Figures 36-37. 

Claim 18: Method of using nucleic acid of Claim 1 attached to an expression control element in 
order to treat Duchenne muscular dystrophy (DMD) or Becker muscular dystrophy (BMD). 
Chamberlain Provisional - Treating a subject with DMD or BMD is taught on page 8, lines 3-7 
and Example 6. 
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Claim 19: Limits Claim 18 by requiring the use of AAV. Chamberlain Provisional - Example 
6 which describes the use of a 4-repeat construct with AAV to treat DMD or BMD. 

Claim 20: Limits Claim 18 by requiring the use of a retrovirus. Chamberlain Provisional - 

teaches the use of retroviruses on pages 41-42 under the heading "6. Retroviruses/ 1 

Claim 21: Essentially repeats Claim 18, but requires that the nucleic acid sequence be less than 
5,000 bases. Chamberlain Provisional - Again, Example 6 describes the treatment of DMD or 
BMD with a construct that is 4.7 kb. 

Claim 22: Further limits Claim 21 by requiring the use of AAV. Chamberlain Provisional - 
Example 6 uses AAV. 

Claim 23: Further limits Claim 21 by requiring the use of a retrovirus. Chamberlain 
Provisional - teaches the use of retroviruses on pages 41-42 under the heading "6. Retroviruses." 

Claim 26: Dependent on Claim 11, further requiring an expression control element. 
Chamberlain Provisional - see support for Claim 1 1 and discussion of support for control 
elements above. 

Claim 27: Dependent on Claim 12, further requiring an expression control element. 
Chamberlain Provisional - see support for Claim 12 and discussion of support for control 
elements above. 

Claim 28: Dependent on Claim 13, further requiring an expression control element. 
Chamberlain Provisional - see support for Claim 13 and discussion of support for control 
elements above. 

In light of the teaching in the Chamberlain Provisional described above, it is clear that the 
Chamberlain Patent Pub anticipates all of the exemplary Xiao Claims that include a 4 repeat 
mini-dystrophin under 35 USC 102(e). As such, allowed Xiao claims similar to or identical to 
these claim (that include the 4 rod repeats) should be withdrawn from issue as unpatentable over 
the Chamberlain Pat Pub. 



III. Withdrawal from Issuance Under 37 CFR 1.313 

To the extent any of the Xiao claims that have been allowed include 4 repeat mini- 
dystrophins, these claims must be withdrawn from allowance. The Patent Office regulations 
specifically contemplate that an application may withdrawn after it has been allowed. In 
particular, 37 CFR 1 .313, entitled "Withdrawal from issue" indicates that an Application "may be 
withdrawn from issue for further action at the initiative of the Office or upon petition by the 
Applicant." The wording of section 1 .3 13(a) makes it clear that the Examiner can withdraw the 
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Application for any reason. The wording of section 1.313(b) makes it clear that the Examiner 

can still withdraw the Application after the issue fee has been paid for a number of reasons, 

including: (1) Unpatentability of one or more claims; or (2) Interference. 

While it is unknown by Protestors whether the issue fee has been paid (which is unlikely 

given that the Notice of Allowance was only mailed about 12 days ago), it is not relevant because 

this case should be withdrawn based on the unpatentability of any claims that encompass 4 repeat 

mini-dystrophins in light of the Chamberlain Pat Pub. It is also noted that, under MPEP 1308, 

even if the issue fee has been paid, an allowed Application: 

"may be removed from the Office of Patent Publication, without it being withdrawn from 
issue under 37 CFR 1.313(b) to permit the Examiner to consider an information 
disclosure statement or whether one or more claims are unpatentable." 

Therefore, at a minimum, Protestors urge the Examiner to request that the file be returned to the 
Examiner to consider the patentability of the claims in light of the Chamberlain Pat Pub, in the 
interest of not issuing an invalid patent. Furthermore, Protestors submit that any claims that 
encompass 4 rod repeats must cause the application to be withdrawn from issuance such that the 
claims can be rejected in light of the Chamberlain Patent Pub. 

Finally, as noted above, a second reason an Application may be withdrawn from issue 
(even if the issue fee has been paid) is for an Interference. Protestors are currently prosecuting 
U.S. Application Ser. No. 10/149,736, which published as the Chamberlain Pat Pub. This 
Application is currently in active prosecution, with claims directed specifically to nucleic acid 
sequences encoding mini-dystrophins with 4 rod repeats. Prior to declaring an Interference, since 
the Chamberlain Application would be the senior party with respect to 4 repeat constructs, 
Protestors submit that the Examiner should reject any Xiao claims that include 4 repeat 
constructs. Only IF Xiao is able to demonstrate invention prior to the Chamberlain Provisional 
filing date would an Interference be possible. 

Conclusion 

To the extent any of the allowed Xiao claims encompass 4 rod repeat mini-dystrophins, 
these claims must be withdrawn from issuance and rejected in light of the Chamberlain Patent 
Pub. Protestors submit that it would be unfair to the public to allow any 4 repeat encompassing 
Xiao claims to issue as these claims would be invalid in light of the Chamberlain Pat Pub. 
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Applicants note for the Examiner's convenience that a copy of this communication will be sent to 
Xiao's representative at the correspondence address listed on the cover of the Xiao Pat Pub, as 
well as to the Examiner handling the pending Chamberlain Application (with the next 
communication in that case). 



Dated: May. 1 0 J 7004 ' — 





ison R. Bond 
Registration No. 45,439 
Medlen & Carroll, llp 

440 Science Drive, Suite 203 
Madison, Wisconsin 53711 
608-218-6900 
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Abstract 



The present invention provides a series of novel mini-dystrophin genes that retain the essential 

biological functions. The expression of the mini-dystrophin genes are under the control of a 

muscle-specific promoter or a non-muscle-specific promoter along with a small polyadenylation 

signal. The entire gene expression cassettes can be readily packaged into adeno-associated virus 

(AAV) vectors. Moreover, the present invention provides a method of treatment for Duchenne 

muscular dystrophy (DMD) and Becker muscular dystrophy BMD). using the novel AAV 

« vectors containing the mini-genes. These mini-dystrophin genes have been demonstrated by the 

0 inventor in a DMD mouse model mdx to be able to alleviate muscular dystrophic pathology and 
TU 

P to result in normal myofiber morphology, histology and cell membrane integrity. Finally, the 

^ present invention further defines the minimal functional domains of dystrophin and provides 

^ ways to optimize and create new versions of mini-dystrophin genes. 
■ 

© 

© 
© 
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TECHNICAL FIELD 



The present invention provides a series of novel mini-dystrophin genes that retain the 
essential biological, functions. The expression of the mini-dystrophin genes are under the control 
of a muscle-specific promoter or a non-muscle-specific promter along with a small 
polyadenylation signal. The entire gene expression cassenes can be readily packaged into adeno- 
associated virus (AAV) vectors. Moreover, the present invention provides an method of 
treatment for Duchenne muscular dystrophy (DMD) and Becker muscular dystrophy BMD) 
using the novel AAV vectors containing the mini-genes, which are demonstrated by the inventor 
in a DMD mouse model max to alleviate muscular dystrophic pathology and to result in normal 
myofiber morphology, histology and cell membrane integrity. Finally, the present invention 
further defines the minimal functional domains of dystrophin and provides ways to optimize and 
create new versions of mini-dystrophin genes. 



BACKGROUND OF INVENTION 

Duchenne muscular dystrophy (DMD) is the most common and lethal genetic muscle 
disorder, caused by recessive mutations in dystrophin gene. [Koenig, M. et.ai. Cell 50, 509-5 1 7 
(1987)]. One of every 3500 males suffers from DMD, yet no treatment is available. Genetic 
therapeutic approaches using primarily myoblast transplantation or adenovirus-mediated gene 
transfer, have met with little success. [Partridge, T. A. et al., Nature 337, 176-179 (1989); 
Acsadi, G. et al. Nature 352, 815-818 (1991); Ragot, T. et al Nature 361, 647-650 (1993); and 
Gussoni, E., et al., Nature Med. 3, 970-977 (1997)]. Adeno-associated virus (AAV) vectors, 
although proven superior for muscle gene transfer, [Xiao, X. et al., Journal of Virology 70, 8098- 
8108 (1996)] are too small (5 kb) to package the dystrophin gene (14 kb cDNA). 



Gene delivery is an important method for the treatment of acquired and inherited 
diseases. A number of viral based systems are being developed for gene transfer purposes. In 
particular, retroviruses are currently the most widely used viral vector system for gene delivery. 
Although retroviral systems are popular, they suffer from several drawbacks. Especially, 
retroviral particles are relatively labile and hence unstable. Therefore, purification of 

* * 

recombinant viruses can lead to significant loss in titer. Moreover, retroviruses have a limited 
host range and cannot integrate into nonreplicating cells. Accordingly, cells do not normally 
divide, such as mature muscle myofibers and neurons, or cells which replicate slowly, cannot be 
genetically altered using retroviral vectors unless stimulated to divide before infection. 
Furthermore, retroviruses are known to cause disease in certain animals, including humans, and 
thus pose a significant health risk to the subject transfected with a recombinant virus. 

Adenovirus based systems have been developed for gene delivery in an attempt to 
overcome these problems. Human adenoviruses are double-stranded DNA viruses which enter 
cells by receptor-mediated endocytosis. These viruses are particularly well suited for gene 
transfer because they are easy to grow and manipulate and they exhibit a broad host range in 
vivo and in vitro. Despite these advantages, adenovirus vectors suffer from several drawbacks. 
For example, adenovirus vectors express proteins transiently because the transferred gene does 
not integrate into the chromosome of the target cell. Hence, as the cells divide, the transferred 
gene is lost. In this regard, such vectors are ineffective for long term gene therapy. Furthermore, 
adenovirus vectors express viral proteins that may elicit an immune response which may 
decrease the life of the transduced cell. This immune response may preclude subsequent 
treatments because humoral and/or T cell responses. Finally, Ad vectors can not efficiendy infect 
mature muscle due to the inability to bypass the barrier of extracellular matrix. 



Adeno-associatee virus (AAV), the only non-pathogenic viral vector currently available, 
has been successfully used to establish efficient and long-term gene expression in both dividing 
and non-dividing cells in vivo without significant immune response or toxicity [Samulski, R. J. et 
al Development of Human Gene Therapy 131-172 ,Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor, (1999)]. Unlike other viral and non-viral vectors, AAV readily bypasses 
extracellular barriers in muscle due to its small particle size (20 run), and it successfully 
transduces myofibers of various maturity. Currently, AAV vectors offer the best gene transfer 
efficiency and longevity among all viral and non-viral vectors tested for gene therapy in muscle 
tissues. This unparalleled efficiency and safety of the vector system have led to an increasing 
interest in AAV-mediated gene therapy for genetic muscle disorders as well as for metabolic 
diseases involving genes of smaller size (< 4.5 kb). [Kessler, P. D. et al Proceedings of the 
m National Academy of Sciences of the United States of America 93, 14082-14087 (1996); Song, 
jg S. etal Proc Natl Acad Sci USA 95, 14384-14388 (1998);Herzog, R. W. etal Nat- Med S f 56- 
0 63 (1999); and Xiao, X. et al J. Virol 74, 1436-1442 (2000)]. 

\i 

S\ Previous attempts to generate mini-genes that were shorter than 1/2 of the full length 

i 

Q dystrophin failed to preserve the essential protective functions. [Yuasa, K. et al, FEBS Lett 425, 

jjj 329-336 (1998)]. Although the mini-genes contained both intact N- and C-terminal domains and 

P 1 to 3 central rod repeats, they were functionally similar to a C-terminal dvstrophin construct 
0 

(Dp71 ), [Cox, O. A. et al., Nat Genet & % 333-339 (1994); Greenberg, D. S. et al„ Nat Genet 8, 
340-344 (1994)], and thus sufficient to restore DAP complexes but insufficient to protect muscle 
from dystrophic pathology. However, the mini -dystrophin genes reported in the present 
invention accommodated at least 5 rod repeats (Rl, R2, R22, R23 and R24) and two hinges (HI 
and H4). Therefore we hypothesized that the length of the central rod domain is the most critical 
factor, based on the fact that a primary role of dystrophin is to crosslink the myofiber cytoskelton 
and plasma membrane and stabilize the structure during muscle contraction. If the dystrophin is 
too short to span the sliding distance between the cytoskeleton and plasma membrane during 
muscle contraction, the crosslink will be disrupted and the muscle membrane will become 



I 

I 

unstable and prone to mechanical damages. To accommodate as many rod units in the central 
domain without exceeding the AAV vector packaging limit, we have for the first time deleted the 
entire C-terminus (819 bp) without sacrificing the primary functions of dystrophin. Our results 
indicate that 5 rods and 2 hinges provide sufficient length and flexibility for the central domain. 

The most importance is that AAV is the best vector system currently available for 
muscle-based gene therapy. However, until this report AAV's utility has been precluded for 
DMD, the most common and lethal muscle disorders. Previously other viral and nonviral vectors, 
[Acsadi, G. et al Nature 352, 815-818 (1991); Ragot, T. et al Nature 361, 647-650' (1993)] as 
well as myoblast transplantation [Partridge, T. A. et al., Nature 337, 176-179 (1989); Gussoni, E. 
et al, Narure Med 3, 970-977 (1997)] have been explored for DMD with limited success. 
Recent studies using stem-cell transplantation have offered a new hope for cell therapeutics of 
DMD. [Gussoni, E. et al Nature 401, 390-394 (1999)]. The novel functional dystrophin genes 
reported here should also find their utilities in the stem-cell therapy after ex vivo gene transfer. 
Nevertheless, the primary advantage of AAV vector is its direct in vivo gene delivery such as 
intramuscular injections, or in vivo vector delivery through blood circulation [Greelish, J. P. et 
al Nat Med 5, 439-443 (1999)] . Finally, using the AAV vector rather than the traditional 
transgenic mouse technology, we have provided a more convenient and less time-consuming 
method to further discern the dystrophin functional domains in vivo and to optimize the mini- 
genes for DMD gene therapy. 




5 



SUMMARY OF THE INVENTION 

The present invention provides the dystrophin gene which can be successfully reduced to 
approximately one third (1/3) of its 1 1 kb'full-Iength coding sequence, without compromising 
essential functions in protecting muscles from dystrophic phenotypes. Moreover, the present 
invention provides AAV vectors carrying the mini -genes and capable of mediating efficient and 
stable correction of both biochemical and physiological defects in a major muscle group of a 
DMD animal model. Furthermore, the expression of the mini-dystrophin genes are controlled 
either by a muscle-specific promoter, or by a non-muscle-specific promoter along with a small 
polyadenylation signal. New development in systemic delivery of AAV vectors through the 
blood circulation should enable more widespread gene transfer in large groups of muscle for 
DMD gene therapy. Furthermore, the present invention provides a method that is more 
convenient and' less time-consuming to discern the dystrophin functional domains in vivo and to 

• * 

• optimize the mini-genes for DMD gene therapy. 



BRIEF DESCRIPTION OF THE DRAWINGS 
Figure 1. Construction of highly truncated mini-dystrophin genes. 



Dystrophin has four major domains: the N-terminal domain (N), the cysteine-rich domain 
(CR), the C-terminal domain (CT) and the central rod domain, which contains 24 rod repeats (R) 
and 4 hinges (H). The mini-dystrophin genes were constructed by deleting a large portion of the 
central rods and hinges and nearly the entire CT domain (except the last 5 amino acids). The 
mini-dystrophin genes were subsequently cloned between an MCK (muscle-specific creatine 
kinase) promoter, or a CMV promoter, along with a small polyA sequence in AAV vectors. 

D 

fU Figure 2. Immunofluorescent (IF) analysis of the dystrophin and dystrophin-associated protein 

jjfj complexes in gastrocnemius muscle. 

SI 

V{ a, Cryosections of mdx muscle at 3 -months after treatment with construct MCK- 

3849 (A) or MCK-3990 (B) were IF stained with an antibody against 
dystrophin (Dys3, green color) and counter-stained for cell nuclei with DAPI 



(JJ (blue color). Photos were taken with a 4X microscope lens. Note that 

O 

P widespread mini-gene expression and healthy peripheral nucleation are evident. 

b. b. Cryosections of muscles from 15-week old normal C57/B10 mice, or mdx 

mice treated either with vector MCK-3849, MCK-3990 or MCK-4173, or 
untreated mdx mice, were IF stained with antibodies against dystrophin (1 st 
row), or against a-sarcoglycan (2 nd row), P-sarcoglycan (3 rd row) and y- 
sarcoglycan (4 row). Cryosections stained with anti-dystrophin antibody (1 
row) were also counter-stained with DAPI (blue color) for cell nuclei. Photos 
were taken with a 20X microscope lens. 



Figure 3. In vivo myoflber plasma membrane integrity test. 

At 15 hours after intravenous injection of Evans Blue dye, the gastrocnemius muscle 
either from 1 5-week old.normal C57/B10 mice, from mdx mice treated with AAV vectors 
containing mini-genes A3849, A3990 or A4173, or from the untreated mdx mice were collected 
and cryosectioned. Dystrophin or mini-dystrophin expression was visualized by 
immunofluorescent staining (l* 1 column, green color). The leaky myofibers were visualized by 
the uptake of Evans Blue dye showing red fluorescence (2 nd column). Note the mutual 
exclusivity between dystrophin expression and Evans Blue dye uptake, when the two images 
were superimposed (3 ri column). Photos were taken with a 10X microscope lens. 



DNA Sequence ID No 1 to No 9: 



Seq, ID No 1: Mini-dystrophin A4173 It is a mihi-dystrbphin construct. 



AttttcaCCatgGtttggtgggaagaagtagaggactgttatgaaagagaagatgttcaaaagaaaacattcacaaaatg 



ggtaaatgcacaattttctaagtttgggaagcagcatattgagaacctcttcagtgacctacaggatgggaggcgcctcc 



tagacctcctcgaaggcctgacagggcaaaaactgccaaaagaaaaaggatccacaagagttcatgccctgaacaatgte 

P 

fU 

m aacaaggcactgcgggtlttgcagaacaataatgttgatttagtgaatattggaagtactgacatcgtagatggaaatca 

VI 
M 



taaactgactcttggUtgatttggaataUatcxtccactggcaggtcaaaaatgtaatgaaaaatatcatggctggat 



0 

jjj tgcaacaaaccaacagtgaaaagattctcctgagctgggtccgacaatcaactcgtaatiatccacaggttaatgtaatc 
0 aacttcaccaccagctggtctgatggcctggctttgaatgctctcatccatagtcataggccagacctatttgactggaa 



tagtgtggtttgccagcagtcagccacacaacgactggaacatgcattcaacatcgccagatatcaattaggcatagaga 



aactactcgatcctgaagatgttgataccacctatccagataagaagtccatcttaatgtacatcacatcactcttccaa 



gttttgcctcaacaagtgagcattgaagccatccaggaagtggaaatgttgccaaggccacctaaagtgactaaagaaga 



acattttcagttacatcatcaaatgcaciattctcaacagatcacgglcagtctagcacagggatatgagagaacttctt 



cccctaagcctcgattcaagagctatgcctacacacaggctgcttatgtcaccacctctgaccctacacggagcccattt 



ccttcacagcatttggaagctcctgaagacaagtcatttggcagttcattgatggagagtgaagtaaac^tggaccgtta 



tcaaacagctttagaagaagtattatcgtggcttctttctgctgaggacacattgcaagcacaaggagagatttccaatg 



atgtggaagtggtgaaagaccagtttcatactcatgaggggtacatgatggatttgacagcccatcagggccgggttggt 



aatattctacaattgggaagtaagctgattggaacaggaaaattatcagaagatgaagaaactgaagtacaagagcagat 

0 

fjj gaatctcctaaattcaagatgggaatgcctcagggtagctagcatggaaaaacaaagcaatttacatagagttttaacgg 

2 
S 

^ atctccagaatcagaaactgaaagagttgaatgactggctaacaaaaacagaagaaagaacaaggaaaatggaggaagag 

» 

0 



cctcttggacctgatcttgaagacctaaaacgccaagtacaacaacataaggtgcttcaagaagatctagaacaagaaca 



agtcagggtcaattctctcactcacatggtggtggtagttgatgaatctagtggagatcacgcaactgctgctttggaag 



aacaacttaaggtattgggagatcgatgggcaaacatctgtagatggacagaagaccgctgggttcttttacaasacate 



cttctcaaatggcaacgtcttactgaagaacagtgcctttttagtgcatggctttcagaaaaagaagatgcagtgaacaa 



gattcacacaactggcutaaagatcaaaatgaaatgttatcaagtcttcaaaaactggccgitttaaaagcggatctag 



aaaagaaaaagcaaiccatgggcaaactgtattcactcaaacaagatcttctttcaacactgaagaataagtcagigacc 



1A 



0 

i 

□ 



cagaagacggaagcatggctggataactttgcccggtgttgggataatttagtccaaaaacttgaaaagagtacagcaca 



gactcatagattactgcaacagttccccctggacctggaaaagtt^^ 



atgtcctacaggatgctacccgtaaggaaaggctcctagaagactccaagggagtaaaagagctgatgaaacaatggcaa 



gacctccaaggtgaaattgaagctcacacagatgtttatcacaacctggatgaaaacagccaaaaaatcctgagatccct 



ggaaggttccgatgatgcagtcctgttacaaagacgtttggataacatgaacttcaagtggagtgaacttcggaaaaagt 

2 

ctctcaacattaggtcccatttggaagccagttctgaccagtggaagcgtctgcacctttctctgcaggaacUctggtg 

(9 

tggclacagctgaaagatgatgaattaagccggcaggcacctattggaggcgactttccagcagttcagaagcagaacga 

SI 



tgtacatagggccttcaagagggaattgaaaactaaagaacctgtaatcatgagtactcttgagactgtacgaatatttc 



tgacagagcagcctttggaaggactagagaaactctaccaggagcccagagagctgcctcctgaggagagagcccagaat 



gtcactcggcttctacgaaagcaggctgaggaggtcaatactgagtgggaaaaattgaacctgcactccgctgactggca 



gagaaaaatagatgagacccttgaaagactccaggaacttcaagaggccacggatgagctggacctcaagctgcgccaag 



ctgaggtgatcaagggatcctggcagcccgtgggcgatctcctcattgactctctccaagatcacctcgagaaagtcaag 



gcacttcgaggagaaattgcgcctctgaaagagaacgtgagccacgtcaatgaccttgctcgccagcttaccactttggg 



11 



cattcagctctcaccgtataacctcagcactctggaagacctgaacaccagatggaagcttctgcaggtggccgtcgagg 
accgagtoaggcagctgcatgaagcccacagggac^ 

ccctgggagagagtxatctcgccaaacaaagtgccctactatatcaaccacgagactcaaacaacttgctgggaccatcc 

caaaatgacagagctctaccagtctttagctgacctgaataatgtcagattctcagcttataggactgccatgaaactcc 

gaagactgcagaaggccctttgcttggatctcttgagcctgtcagctgcatgtgatgccttggaccagcacaacctcaag 

caaaatgaccagcccatggatatcctgcagattattaattgtttgaccactatttatgaccgcctggagcaagagcacaa 
caamggtcaacgtecstctctgcgtggaut^ 

ggatccgtgtcctgtcttttaaaactggcatcatttccctgtgtaaagcacatttggaagacaagmcagatac^ 

aagcaagtggcaagttcaacaggatmgtgaccagcgcaggctgggcctccttctgcatgattctatccaaattccaag 

acagttgggtgaagttgcatccmgggggcagtaacattgagccaagtgtccggagctgcttccaatttgctaata 

agccagagatcgaagcggccctcttcctagactggatgagactggaaccccagtccatggtgtggctgcccgtcct 

agagtggctgctgcagaaactgccaagcatcaggccaaatglaacatctgcaaagagtgtccaatcattgga 

caggagtclaaagcactttaattatgacatctgccaaagctgcttrattctggtcgagttgcaaaag^ 



actatcccatggtggaatattgcactccgactacatcaggagaagatgttcgagactttgccaaggtactaaaaaacaaa 



tttcgaaccaaaaggtattttgcgaagcatccccgaatgggctacctgccagtgcagactgtcttagagggggacaacat 



ggaaactcccgacacaatgtag 



Seq* ID No 2; Mini-dystrophin A3990, It is a mini-dystrophin construct. 



AttttcaCCatgGtttggtgggaagaagtagaggactgttatgaaagagaagatgttcaaaagaaaacattcacaaaatg 



ggtaaatgcacaattttctaagtttgggaagcagcatattgagaacctcttcagtgacctacaggatgggaggcgcctcc 



tagacctcctcgaaggcctgacagggcaaaaactgccaaaagaaaaaggatccacaagagttcatgccctgaacaatgtc 



aacaaggcactgcgggttttgcagaacaataatgttgatttagtgaatattggaagtactgacatcgtagatggaaatca 



taaactgactcttggtttgatttggaatataatcctccactggcaggtcaaaaatgtaatgaaaaatatcatggctggat 



tgcaacaaaccaacagtgaaaagaactcctgagctgggtccgacaatcaactcgtaattatccacaggttaatgtaatc 



aacttcaccaccagctggtctgatggcctggctttgaatgctctcatccatagtcataggccagacctatttgactggaa 



tagtgtggutgccagcagtcagccacacaacgactggaacatgcattcaacatcgccagatatcaattaggcatagaga 



aactactcgatcctgaagatgttgataccacctatccagataagaagtccatcltaMgiacatcacatcactcttccaa 



gttttgcctcaacaagtgagcattgaagccatccaggaagtggaaatgttgccaaggccacctaaagtgactaaagaaga 



acattttcagttacatcatcaaatgcactattctcaacagatcacggtcagtctagcacagggatatgagagaacctctt 



cccctaagcctcgaltcaagagctatgcctacacacaggctgcttatgtcaccacctctgacxctacacggagcccattt 



ccttcacagcatttggaagctcctgaagacaagtcatttggcagttcattgatggagagtgaagtaaacctggaccgtta 



Ccaaacagctttagaagaagtattatcgtggcttctttctgctgaggacacattgcaagcacaaggagagatttctaatg 



atgtggaagtggtgaaagaccagtttcatactcatgaggggtacatgatggautgacagcccatcagggccgggttggt 



aatattctacaattgggaagcaagctgauggaacaggaaaattatcagaagatgaagaaactgaagtacaagagcagat 



M 

0 gaatctcctaaattcaagatgggaatgcctcagggtagctagcatggaaaaacaaagcaatttacatagagttttaatgg 



55 

Q atctrcagaatcagaaactgaaagagttgaatgactggctaacaaaaacagaagaaagaaeaaggaaaatggaggaagag 

0 



cctcttggacctgatcttgaagacctaaaacgccaagtacaacaacataaggtgcttcaagaagatctagaacaagaaca 



agtcagggtcaattctctcactcacatggtggtggtagttgatgaatctagtggagatcacgcaactgctgctttggaag 



aacaacttaaggtattgggagatcgatgggcaaacatctgtagatggacagaagaccgctgggttcttttacaagaccag 



cctgatxtagctcctggactgacc^ctauggagc^tctcctactcagactgttacictggtgacacaacctgtggttac 



taaggaaactgccatctccaaactagaaatgccatet^ 



tccccctggacctggaaaagtttcttgcctggcttacagaagctgaaacaactgccaatgtcctacaggatgciacccgt 



aaggaaaggctcctagaagactccaagggagtaaaagagctgatgaaacaatggcaagacctccaagglgaaattgaagc 



tcacacagatgtttatcacaacctggatgaaaacagccaaaaaatcctgagatccctggaaggttccgatgatgcagtcc 



IP 



tgttacaaagacgtttggataacatgaacttcaagtggagtgaacttcggaaaaagtctctcaacattaggtcccatng 



gaagccagttctgaccagtggaagcgtctgcacctttctctgcaggaacttctggtgtggctacagctgaaagatgatga 



Si attaagccggcaggcacctattggaggcgactttccagcagttcagaagcagaacgatgtacatagggccttcaagaggg 

P aattgaaaacLaaagaacctgtaateatgagtactcngagactgtacgaatatttctgacagagcagcctttggaagga 

tit 
09 

(jj ctagagaaactctaccaggagcccagagagctgccCcctgaggagagagcccagaatgtcactcggcttctacgaaagca 



ggctgaggaggtcaatactgagtgggaaaaattgaacctgcactccgctgactggcagagaaaaatagatgagacccttg 



aaagactccaggaacttcaagaggccacggatgagctggacctcaagctgcgccaagctgaggtgatcaagggatcctgg 



cagcccgtgggcgatctcctcangactctc^ccaagatcacxtcgagaaagtcaaggcacttcgaggagaaattgcgcc 



tctgaaagagaacgtgagccacgtcaatgaccttgctcgccagcttaccactttgggcattcagctctcaccgtataacc 



15 



tcagcactctggaagacctgaacaccagatggaagcttctgcaggtggccgtcgaggaccgagtcaggcagctgcatgaa 
gcccacagggactttggtccagcatctcagcac^ 



aaacaaagtgccctactatatcaaccacgagactcaaacaacttgctgggaccatcccaaaatgacagagctctaccagt 



ctttagctgacctgaataatgtcagattctcagcttataggactgccatgaaactccgaagactgcagaaggccctttgc 



ttggatctcttgagcctgtcagctgcatgtgatgccttggaccagcacaacctcaagcaaaatgaccagcccatggatat 



cctgcaganattaattgtttgaccactatttatgaccgcctggagcaagagcacaacaatttggtcaacgtccctctct 



gcgtggatatgtgtctgaactggctgctgaatgtttatgatacgggacgaacagggaggatccgtgtcctgtctmaaa 



actggcatcatttccctgtgtaaagcacatttggaagacaagtacagataccttttcaagcaagtggcaagttcaacagg 



attttgtgaccagcgcaggctgggcctccttctgcatgattcutccaaattccaagacagngggtgaagttgcatcct 



Ugggggcagtaacattgagccaagtgtccggagctgcttccaatttgctaataataagccagagatcgaagcggccctc 



ttcctagactggatgagactggaaccccagtccatggtgtggctgcccgtcctgcacagagtggctgctgcagaaactgc 



caagcatcaggccaaatgtaacatctgcaaagagtgtccaatcattggattcaggtacaggagtctaaagcactttaatt 



atgacatctgccaaagctgcmttttctggtcgagttgcaaaaggccauaaatgcaciatax^ggtggaatattgc 



03 

0 



actccgactacatcaggagaagatgncgagactttgccaagglaclaaaaaacaaatticgaaccaaaaggtattttgc 



gaagcatccccgaatgggctacctgccagtgcagactgtcttagagggggacaacatggaaactcccgacacaatgtag 



Seq. ID No 3: Minl-dystrophin A3849. It is a mini-dystrophin construct 



AttttcaCCatgGtttggtgggaagaagtagaggactgttatgaaagagaagatgttcaaaagaaaacattcacaaaatg 



ggtaaatgcacaattttctaagtttgggaagcagcatattgagaacctcttcagtgacctacaggatgggaggcgcctcc 

.in 

P tagacctcctcgaaggcctgacagggcaaaaactgccaaaagaaaaaggatccacaagagttcatgccctgaacaatgtc 
W 

s^j aacaaggcactgcgggttttgcagaacaataatgttgatttagtgaatattggaagtactgacatcgtagatggaaatca 

SJ 

M 

P taaactgactcttggtttgatttggaatataatcctccactggcaggtcaaaaatgtaatgaaaaatatcatggctggat . 



tgcaacaaaccaacagtgaaaagattctcctgagctgggtccgacaatcaacicgtaattatccacaggttaatgtaatc 



aacttcaccaccagctggtctgatggcctggctttgaatgctctcatccatagtcataggccagacctatttgactggaa 



tagtgtggtttgccagcagtcagccacacaacgactggaacatgcattcaacatcgccagatatcaattaggcatagaga 



aactactcgatcctgaagatgttgataccacctatccagataagaagtccatcttaatgtacatcacatcactcttccaa 



gttttgcctcaacaagtgagcaitgaagccatccaggaagtggaaatgttgccaaggccacctaaagtgactaaagaaga 



17 



. acattttcagttacatcatcaaatgcactattctcaacagatcacggtcagtctagcacagggatatgagagaacttctt 



cccctaagcctcgattcaagagctatgcctacacacaggctgcttatgtcaccacctctgaccctacacggagcccattt 



ccttcacagcatttggaagctcctgaagacaagtcatttggcagttcattgatggagagtgaagtaaacctggaccgtta 



tcaaacagctttagaagaagtattatcgtggcttctttctgctgaggacacangcaagcacaaggagagautctaatg 



atgtggaagtggtgaaagaccagtttcatactcatgaggggtacatgatggatttgacagcccatcagggccgggttggt 



aatattctacaattgggaagtaagctgattggaacaggaaaattatcagaagatgaagaaactgaagtacaagagcagat 



gaatctcctaaattqaagatgggaatgcctcagggtagctagcatggaaaaacaaagcaatttacatagagttttaatgg 



atctccagaatcagaaactgaaagagttgaatgactggctaacaaaaacagaagaaagaacaaggaaaatggaggaagag 



cctcttggacctgatcttgaagacctaaaacgccaagtacaacaacataaggtgcucaagaagatctagaacaagaaca 



agtcagggtcaattctctcactcacatggtggtggtagttgatgaatctagtggagatcacecaactgctgctttggaag 



aacaacttaaggtattgggagatcgatgggcaaacatctgtagatggacagaagaccgctgggttcttttacaagacact 



catagauactgcaacagttccccctggacctggaaaagtttcugcccggcctacagaagccgaaacaactgccaatgt 



cctacaggatgctacccgtaaggaaaggctcctagaagactccaagggagtaaaagagccgatgaaacaatggcaagacc 



tccaaggtgaaattgaagcteacacagatgtttatcacaacctgga^ 



ggttccgatgatgcagtcctgttacaaagacg^ 



caacattaggtccxatttggaagccagttctgacxagtggaagcgictgcacctttctctgcaggaacttctggtgtgg^ 



tacagctgaaagatgatgaattaagccggcaggcacctattggaggcgactticcagcagttcagaagcagaacgatgta 



catagggccrtcaagagggaattgaaaactaaagaacctgtaatcatgagtactcttgagactgtacgaatatttctgac 



□ agagcagcctttggaaggactagagaaactctaccaggagcccagagagctgcctcctgaggagagagcccagaatgtca 

p 
p 

SJ ctcggcttctacgaaagcaggctgaggaggtcaatactgagtgggaaaaattgaacctgcactccgctgactggcagaga 

P aaaatagatgagacccttgaaagactccaggaacttcaagaggccacggatgagctggacctcaagctgcgccaagctga 
CD 

P . ggtgatcaagggatcctggcagcccgtgggcgatctcctcattgactctctccaagatcacctcgagaaagtcaaggcac 

□ 

■ 

ttcgaggagaaattgcgcctctgaaagagaacgtgagccacgtcaatgaccttgctcgccagcttaccactttgggcatt 



cagctctcaccgtataacxicagcactctggaagacctgaacaccagatggaagcttctgcaggtggccgtcgaggaccg 



agtcaggcagctgcatgaagcwacagggacmggtccagcatctcagcactttctticcacgtctgtccagggtccct 



gggagagagccatctcgccaaacaaagtgccclaciatatcaaccacgagactcaaacaacttgctgggaccatccc 
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atgacagagctctaccagtctttagctgacctgaataatgtcagattctcagcttataggactgccatgaaactccgaag 



actgcagaaggccctttgcttggatctcttgagcctgtcagctgcatgtgatgccttggaccagcacaaccicaagcaaa 



atgaccagcccatggatatcctgcagattattaattgtttgaccactatttatgaccgcctggagcaagagcacaacaat 



ttggtcaacg^ccctctctgcgtggatatgtgtctgaactggctgctgaatgtttatgatacgggacgaacagggaggat 



ccgtgtcctgtcttttaaaactggcatcatttccctgtgtaaagcacatttggaagacaagtacagataccttncaagc 



aagtggcaagttcaacaggattttgtgaccagcgcaggctgggcctccttctgcatgattctatccaaattccaagacag 



ttgggtgaagttgcatccntgggggcagtaacattgagccaagtgtccggagctgcttccaatttgctaataataagcc 



agagatcgaagcggccctcttcctagactggatgagactggaaccccagtccatggtgtggctgcccgtcctgcacagag 



tggctgctgcagaaactgccaagcatcaggccaaatgtaacatctgcaaagagtgtccaatcattggattcaggtacagg 



agtctaaagcacttlaattatgacatctgccaaagctgctttttttctggtcgagttgcaaaaggccataaaatgcacta 



tcccatggtggaatattgcactccgactacatcaggagaagatgttcgagactttgccaaggiactaaaaaacaaatttc 



gaaccaaaaggtattttgcgaagcaiccccgaatgggctacctgccagtgcagactgtcttagagggggacaacatggaa 



actcccgacacaatgtag 



*>n 



e 
ru 



Seq. ID No 4: AAV-MCK-A4173. It is an AAV vector containing a MCK promoter driving a 
mini-dystrophin gene connected to a small poly A signal. 

TTGGCCACTCCCTCTCTC^ 
GCCCGACGCCCGGGCTITGCCCG 

GGCGGCCTCAGTGAGCGAGCGAGCGCGCAGAGAGGGAGTGGCCAACTCCATCACTA 
GGGGTTCCTAGATCTGAATTCGag 



cttgcatgcccaclacgggtctaggctgcccatgtaaggaggcaaggcctggggacacccgagatgcctggttataatta 



acccagacatgtggctgccccccx;ccccccaacacctgctgcctgagcctcacccccaccccggtgcctgggtcttaggc 



Si 
M 

S* tctgtacaccatggaggagaagctcgctctaaaaataaccctgtccctggtggatcccctgcatgcccaatcaaggctgt 

c 

jjj gggggactgagggcaggctgtaacaggcttgggggccagggcuatacgtgcctgggactcccaaagtattactgttcca 
? tgttcccggcgaagggccagctgtcccccgccagctagactcagcacttagtttaggaaccagtgagcaagtcagccctt 



ggggcagcccatacaaggccatggggctgggcaagctgcacgcctgggtccggggtgggcacggtgcccgggcaacgagc 



tgaaagctcatctgctctcaggggcccctccctggggacagcccctcctggctagtcacaccctgtaggctcctctatat 



aacccaggggcacaggggctgcccccgggtcacTCGAAttttcaCCatgGtttggtgggaagaagtagaggactgttatg 



aaagagaagalgttcaaaagaaaacattcacaaaatgggtaaatgcacaattttciaagUtgggaagcagcatattga 
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aacctcttcagtgacctacaggatgggaggcgcctcctagacctcctcgaaggcctgacagggcaaaaactgccaaaaga 

aaaaggatecacaagagttcatgccctgaacaatgtcaacaaggcactgcgggmtgcagaac 

tgaatattggaagtactgacatcgtagatggaaatcataaactgactcttggmgamggaatau 
caggtcaaaaatgtaatgaaaaatatcatggctggattgcaacaaaccaacagtgaaaagattctcctgagctgggtccg 

acaatcaactcgtaattatccaeaggt^^ 
tcatccatogtcataggccagaccumgactgg^ 

gcattcaacatcgccagatatcaattaggcatagagaaactactcgatcctgaagatgttgataccacctatccagataa 

gaagtccatcttaatg^atcacatcactcttccaagttttgcctcaacaagtgagcattgaagccatccagg 

aaatgttgccaaggccacctaaagtgactaaaga^ 

acggtcagtctagcacagggatatgagagaacttcttcxcctaagcctcgattcaagagctatgcctacacacaggctgc 
ttatgtcaccacctctgaccctacacggagcccatttccttcacagcatttggaagcrcctgaagacaagtcatttggca 
gticattgatggagagtgaagtaaacctggaccgttatcaaacagctttagaagaagtanatcgtggcttctttctgct 
gaggacacaltgcaagcacaaggagagatuctaatgatgiggaagtggtgaaagaccagtttcatactcatgaggggta 



catgatggatttgacagaxatcagggccgggttggtaatattciacaattgggaagtaagctgattggaacaggaaaat 



tatcagaagatgaagaaactgaagtacaagagcagatgaatctcctaaattcaagatgggaatgcctcagggtagctagc 



atggaaaaacaaagcaatttacatagagttttaatggatctccagaatcagaaactgaaagagttgaatgactggctaac 



aaaaacagaagaaagaacaaggaaaatggaggaagagcctcttggacctgatcttgaagacctaaaacgccaagtacaac 



aacataaggtgcttcaagaagatctagaacaagaacaagtcagggtcaattctctcactcacatggtggtggtagttgat 



Jjp gaatctagtggagatcacgcaactgctgctttggaagaacaacttaaggtattgggagatcgatgggcaaacatctgtag 

IV 
0 

^ atggacagaagaccgctgggttcttttacaagacatccttctcaaatggcaacgtcttactgaagaacagtgccttttta 
Si , 

• gtgcatggctttcagaaaaagaagatgcagtgaacaagattcacacaactggctttaaagatcaaaatgaaatgttatca 
W agtcttcaaaaactggccgttttaaaagcggatctagaaaagaaaaagcaatccatgggcaaactgtattcactcaaaca 



agatcttctttcaacactgaagaaiaagtcagtgacccagaagacggaagcatggctggataactttgcccggtgnggg 



ataatttagtccaaaaacttgaaaagagtacagcacagactcatagattactgcaacagttccccctggacctggaaaag 



uicttgcctggcttacagaagctgaaacaactgccaatgtcciacaggatgctacccgtaaggaaaggctcctagaaga 



ctccaagggagtaaaagagctgaigaaacaatggcaagacctccaaggtgaaattgaagctcacacagatgtttatcaca 



awtggatgaaaacagccaaaaaatcctgagatccctggaaggttccgatgatgcagtcctgttacaaagacgtttggat 



aacatgaacttcaagtggagtgaactteggam^gtctctcaacattaggtcccatttggaagccagttctgaccagtg 



gaagcgtctgcacctttctctgcaggaacttctggtgtggctacagctgaaagatgatgaattaagccggcaggcaccta 



ttggaggcgactttccagcagttcagaagcagaacgatgtacatagggccltcaagagggaattgaaaaciaaagaacct 



gtaatcatgagtactcttgagactgtacgaatatttctgacagagcagcctttggaaggactagagaaacta 

JHL 

£ gcccagagagctgcctcctgaggagagagcccagaatgteactcggcnctacgaaagcaggctgaggaggtcaauct^ 



O 

0 

SI 



0 



agtgggaaaaattgaacctgcactccgctgactggcagagaaaaatagatgagacccttgaaagactccaggaacttcaa 



P gaggccacggatgagctggacctcaagctgcgccaagctgaggtgatcaagggaicctggcagcccgtgggcgatctcct 

i 

5 cattgactctctccaagatcacctcgagaaagtcaaggcacttcgaggagaaattgcgcctctgaaagagaacgtgagcc 
P 



acgtcaatgaccttgctcgccagcttaccactttgggcattcagctctcaccgtataacctcagcactctggaagacctg 



aacaccagatggaagcttctgcaggtggccgtcgaggaccgagtcaggcagctgcatgaagcccacagggactttggtcc 



agcatctcagcactttctttccacgtctgtccagggtccctgggagagagccatctcgccaaacaaagtgccctactata 



tcaaccacgagactcaaacaacttgctgggaccatcccaaaatgacagagctciaccagtcttiagcigacctgaataat 
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gtcagattctcagcttataggactgccatgaaactccgaagactgcagaaggccctttgcttggatctcttgagcctgtc 
agctgcatgtgatgccttggaccagcacaacctcaagcaa^ 

tgaccacmttatgaccgcctggagcaagagcacaacaatttggtcaacgtccctctctgcgtggatatgtgtctgaac 

tggctgctgaatgtttatgatacgggacgaacagggaggatccgtgtcctgtcttttaaaactggcatcatttccctgtg 

taaagcacatttggaagacaagtacagataccttttcaagcaagtggcaagttcaacaggattttgtgaccagcgcaggc 

tgggcetccttctgcatgattctatccaaattccaagacagttgggtgaagttgcatcctttgggggcagtaacattgag 

ccaagtgtccggagctgcttccaamgctaataataagccagagatcgaagcggccctcttcctagactggatgagact 

ggaaccccagtccatggtgtggctgcccgtcctgcacagagtggctgctgcagaaactgccaagcatcaggccaaatgta 

acatctgcaaagagtgtccaatcattggattcaggtacaggagtctaaagcacttuattatgacatctgccaaagctgc 

tttttttctggtcgagttgcaaaaggccataaaatgcactatcccatggtggaatattgcactccgactacatcaggaga 

agatgttcgagactttgccaaggtactaaaaaacaaatttcgaaccaaaaggtattitgcgaagcatccccgaatgg^ct 

acctgccagtgcagactgtcttagagggggacaacatggaaactcccgacacaatgtagicgagaggcctaataaagagc 

tcagatgcatcgatcagagtgtgttggttttttgtgtgaGAT^ 
CCCTCTC 



TGCGCGCTCGCTCGCTCACTGAGGCCGCCCGOGCAAAGCCCGGGCGTCGGGCGACC 
TTTGGTCGCCCGGCCTCAGTGAGC 

GAGCGAGCGCGCAGAGAGGGAGTGGCCAA 

■ 

Seq. ID No 5: AAV-MCK-A3990. It is an AAV vector containing a MCK promoter driving a 
rnini-dystrophin gene connected to a small poly A signal 

TTGGCCACTCCCTCTCTGCGCGCTCGCrCGCT 
GCCCGACGCCCGGGCTTTGCCCG 

GGCGGCCTCAGTGAGCXjAGCGAGCGCGCAGAGAGGGAGTGGCCAACTCCATCACTA 
GGGGTTCCTAGATCTGAATTCGag 

cttgcatgcccactacgggtctaggctgcccatgtaaggaggcaaggcctggggacacccgagatgcctggttataatta 

acccagacatgtggctgcxcxcxcccccccaacacxtgctgcctgagcctcacccccaccccggtgcctg 

tctgtacaccatggaggagaagctcgctctaaaaataaccctgtccctggtggatcccctgcatgcccaatcaaggctgt 

gggggactgagggcaggctgtaacaggcttgggggccagggcttatacgtgcctgggactcccaaagtattactgttcca 

tg^tcccggcgaagggc<^gctgt<xcccgccagctagactcagcacttogmaggaaccagtgagcaagtcag 
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ggggcagcccatacaaggccatggggctgggcaagctgcacgcctgggtccggggtgggcacggtgcccgggcaacg 

tgaaagctcatctgctctcaggggcccctccctggggacagcccctcctggctagtcacaccctgtagg 
aacccaggggcacaggggetgcccccgggtcacTCGAAttttcaCCatgGtttggtgggaagaagtagaggactgttatg 
aaagagaagatgttcaaaagaaaacattcacaaaatgggtaaatgcacaattttctaagtttgggaagcagcatattgag 
aacctcttcagtgacctacaggatgggaggcgcctcctagacctcctcgaaggcctgacagggcaaaaactgccaaaaga 

jjjj aaaaggatccacaagagttcatgccctgaacaatgtcaacaaggcactgcgggimgcagaacaataatp 
IU 

^ tgaatattggaagtactgacatcgtagatggaaatcat^ 
M 



caggtcaaaaatgtaatgaaaaatatcatggctggattgcaacaaaccaacagtgaaaagattctcctgagctgggtccg 



W acaatcaactcgtaattatccacaggttaatgtaatcaacttcaccaccagctggtctgatggcctggctttgaatgctc 
tcatccatagtcataggccagacctaittgactggaatagtgtggtttgccagcagtcagccacacaacgactggaacat 



aa 



gcattcaacatcgccagatatcaattaggcatagagaaactactcgatcctgaagatgttgataccacctatccagat 



gaagtccatcttaatgtacatcacatcactcttccaagttttgcctcaacaagtgagcattgaagccatccaggaagtgg 



aaatgttgccaaggccacctaaagtgactaaagaagaacattttcagttacatcatcaaatgcactattctcaacagatc 



n 



acggtcagtctagcacagggatatgagagaacttcttcccctaagcctcgattcaagagctatgcctacacacaggctgc 
ttatgtcaccacctctga(xctocacggagcccatttecttcacagcatttggaagctcctgaagacaagtcatttggca 

gttcattgatggagagtgaagtaaacctggaccgttatcaaacagctttagaagaagtattatcgtggcttctttctgct , 
gaggacacattgcaagcacaaggagagamctaatgatgtggaag^ggtgaaagaccagtttcaiactcatgaggggta 

catgatggatttgacagcccatcagggccgggpggtaate^ 
Utcagaagatgaagaaactgaagtacaagagcagatgaat^ 

atggaaaaacaaagcaatttacatagagttttaatggatctccagaatcagaaactgaaagagttgaatgactggctaac 

aaaaacagaagaaagaacaaggaaaatggaggaagagcQtcttggacctgatcttgaagacctaaaacgccaagtacaac 

aacataaggtgcttcaagaagatctagaacaagaacaagtcagggicaattctctcactcacatggtggtggiagttgai 

gaatctagtggagatcacgcaactgctgctttggaagaacaacttaaggtattgggagatcgatgggcaaacatctgtag 

atggacagaagaccgctgggttcttttacaagaccagcctgacctagctcctggactgaccactattggagcctctccta 

ctcagactgttactctggtgacacaacctgtggttactaaggaaactgccatctccaaactagaaatgccatcuccug 

atgttggaggtacctactcatagattactgcaacagtt^ 



tgaaacaactgccaatgtcctacaggatgctacccgtaaggaaaggctcctagaagactccaagggagtaaaagagctga 



ccaaaaa 



tgaaacaatggcaagacctccaaggtgaaattgaagctcacacagatgmatcacaacctggatgaaaacag 



atcctgagatccctggaaggttccgatgatgcagtcctgttacaaagacgmggataacatgaacttcaagtggagtga 
acttcggaaaaagtctctcaacattaggtcccatttggaagccagttctgaccaglggaagcgtctgcaccntctctgc 

■ 

aggaacttctggtgtggctacagctgaaagatgatgaattaagccggcaggcacctattggaggcgactttccagcagtl 

tp 

9 cagaagcagaacgatgtacatagggccttcaagagggaattgaaaactaaagaacctgtaatcatgagtactcttgagac 

.m 

P 
P 

^ tgtacgaatatttctgacagagcagcctttggaaggactagagaaactctaccaggagcccagagagctgcctcctgagg 

□ agagagcccagaatgtcactcggcttctacgaaagcaggctgaggaggtcaatactgagtgggaaaaattgaacctgcac 
S 

g tccgctgactggcagagaaaaatagatgagacccttgaaagactccaggaacttcaagaggecacggatgagctggacct 
caagctgcgccaagctgaggtgatcaagggatcctggcagcccgigggcgatctcctcattgactctctccaagatcacc 
tcgagaaagtcaaggcacttcgaggagaaattgcgcctctgaaagagaacgtgagccacgtcaatgaccngctcgccag 
cttaccactttgggcattcagctctcacxgtataacctcagcactctggaagacctgaacaccagatggaagcttctgca 
ggtggccgtcgaggaccgagtcaggcagctgcatgaagcccacagggactttggtccagcatctcagcactttctucca 



cgtctgtccagggtccctgggagaga 
tgctgggaccatcccaaaatgacagagctcta^^ 

tgcxatgaaactccgaagactgcagaaggccctttgcttggatctcttgagcctgtcagctgcatgtga^ 
agcacaacctcaagcaaaatgaccagcccatggatatcctgcagattattaattgtttgaccactatttatgaccgcctg 

« 

gagcaagagcacaacaatttggtcaacgtccctctctgcgtggatatgtgtctgaactggctgctgaatgtuatgatac 

gggacgaacagggaggatccgtgtcctgtctmaaactggcatcatttccctgtgtaaagcacatttggaagacaag^ 

acagatarcttttcaagcaagtggcaagttcaacaggatutgtgaccagcgcaggctgggcctccuctgcatgam 

atccaaattccaagacagttgggtgaagttgcatcctttgggggcagtaacattgagccaagtgtccggagetgcncca 

atttgctaataataagccagagatcgaagcggccctcttcctagactggatgagactggaaccccagtccatggtgtggc 

tgcccgtcctgcacagagtggctgctgcagaaactgccaagcatcaggccaaatgtaacatctgcaaagagtgtccaatc 

attggattcaggtacaggaglctaaagcactttaattatgacatctgccaaagctgctttttttctggtcgagttgcaaa 

aggccataaaatgcactatcccatggtggaatattgcactccgactacatcaggagaagatgttcgagactttgccaagg 

tactaaaaaacaaamcgaaccaaaaggtatmgcgaagcatara^ 



gagggggacaacatggaaactcxcgacacaatgtagtcgagaggcctaataaagagctcagatgcatcgatcagagtgtg 



ttggttttttgtgtgaGATCTAGGAACCCGTAGTGATC 
CGCTCGCTCACTGAG 



GCCGCCCGGGGAAAGCCCGGGCGTCGGGCGACCTTTGGTCGCCCGGCCTCAGTGAG 
CGAGCGAGCGCGCAGAGAGGGAGT 



GGCCAA 

§ Seq! ID No 6: AAV^MCK-A3849. It is an AAV vector containing a MCK promoter driving a 
g mini-dystrophin gene connected to a small polyA signal 

M 

TTGGCCACTCCCIXnXTTGC 
^ GCCCGACGCCCGGGCTTTGCCCG 

W GGCGGCCTCAGTGAGCGAGCGAGCGCGCAGAGAGGGAGTGGCCAACTCCATCACTA 

(3 

$ GGGGTTCCTAGATCTGAATTCGag 

cttgcatgcccactacgggtctaggctgcccatgtaaggaggcaaggcctgggeacacccgagatgcctggttataatta 

acccagacatgtggctgwxcccc<xx;crc^ 



tctgtacac^aiggaggagaagctcgctcUaaaataaccctgtccctggtggatcccctgcatgcccaatcaaggctgt 



gggggactgagggcaggctgtaacaggcttgggggccagggcttat^ 
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tgttcccggcgaagggccagctgtcccccgccagctagactcagcacttagtttaggaaccagtgagcaagtcagccctt 
ggggcagcccatacaaggccatggggctgggca^gctgcacgcctgggtccggggtgggcacggtgcccgggcaacgagc 

tgaaagctcatctgctctcaggggcccctcxctggggaeagc^^ 

aacccaggggcacaggggctgcccccgggtcacTCGAAttttcaCCatgGtttggtgggaagaagtagaggactgtta 

aaagagaagatgttcaaaagaaaacattcacaaaatgggtaaatgcacaattttctaagtttgggaagcagcatattgag 

aacctcttcagtgacetacaggatgggaggcgcctcctagacctcctcgaaggcctgacagggcaaaaactgccaaaaga 

aaaaggatccacaagagttcatgccctgaacaatgtcaacaaggcactgcgggttttgcagaacaataatgttgatttag 

tgaatattggaagtactgacatcgtagatggaaatcataaactgactcttggtttgatttggaatataatcctccactgg 

caggtcaaaaatgtaatgaaaaaUtcatggctggattgcaacaaarcaac^ 
acaatcaactcgtaattatccacaggttaatgtaatcaacttcaccaccagctggtctgatggcctggctttgaatgctc 

teatccatagtcataggccagacctatttgactggaatagtgtggm 

gcattcaacatcgccagautcaattaggcatagagaaactactcgatcxtgaagatgttgataccacctatccagataa 
gaagtccaxtuatgtacatcacaicactcttcc^ 



aaatgttgccaaggccacctaaagtgactaaagaagaacattttcagttacatcatcaaatgcactattctcaacagatc 

acggtcagtctagcacagggatatgagagaacttcttcccctaagcctcgattcaagagctatgcctacacacaggctgc 

ttatgtcaccacctctgaccctacacggagcccatttccttcacagcatttggaagctcxtgaagacaagtcam 

gttcattgatggagagtgaagtaaacctggaccgttatcaaacagctttagaagaagtattatcgtggcttctttctgct 

gaggacacattgcaagcacaaggagagatttctaatgatgtggaagtggtgaaagaccagtttcatactcatgaggggta 

catgatggamgacagcccatcagggccgggttggtaatanctacaattgggaagtaagctgattggaacaggaaaat 

tatcagaagatgaagaaactgaagtacaagagcagatgaatct(xtaaattcaagatgggaatgcctcagggtagctagc 

atggaaaaacaaagcaatttacatagagttttaatggatctccagaatcagaaactgaaagagttgaatgactggctaac 

aaaaacagaagaaagaacaaggaaaatggaggaagagcctcttggacctgatcttgaagacctaaaacgccaagtacaac 

aacataaggtgcttcaagaagatctagaacaagaacaagtcagggtcaattctctcactcacatggtggtggtagttgat 

gaatctagtggagatcacgcaactgctgctttggaagaacaactiaaggtattgggagatcgatgggcaaacatctgtag 

atggacagaagaccgctgggitcnttacaagacactcatagaitactgcaacagttccccctggacctggaaaagtttc 

ttgcctggcttacagaagctgaaacaaagccaatgtcctacaggatgctacccgtaaggaaaggctcctagaagactcc 



aagggagtaaaagagctgatgaaacaatggcaagarcfc^ 



ggatgaaaacagccaaaaaatcctgagatccctggaaggttccgatgatgcagtcct^ttacaaagacgmg 



tgaacttcaagtggagtgaacttcggaaaaagtctctcaacattaggtcccatttggaagccagttctgaccagtggaag^ 



cgtctgcacctttctctgcaggaacttctggtgtggctacagctgaaagatgatgaattaagccggcaggcacctattgg 



aggcgactttccagcagttcagaagcagaacgatglacatagggccttcaagagggaattgaaaactaaagaacctgtaa 

CP 

P tcatgagtactcttgagactgtacgaammctgacagagcagcc 
fil 

o 

0 

VI agagagctgcctcctgaggagagagcccagaatgtcactcggcttctacgaaagcaggctgaggsggtcaatactgagtg 
SJ 



CO 



CJ ggaaaaattgaacctgcactccgctgactggcagagaaaaatagatgagacccttgaaagactccaggaacttcaagagg 

t. 

ccacggatgagctggacctcaagctgcgccaagctgaggtgatcaaggga^tggcagcccgtgggcgatctcctcatt 
gactctctccaagatcacctcgagaaagicaaggcacttcgaggagaaattgcgcctctgaaagagaacgtgagccacgt 



caatgaccttgctcgccagctmccactttgggcattcagctctcaccgtataacctcagcactctggaagacctgaaca 



ccagatggaagcttctgcaggtggccgtcgaggaccgagtcaggcagctgcatgaagcccacagggactttggtccagca 



tctcagcactttctttccacgtetgtccagggfccctggga 



ccacgagactcaaacaacttgctgggaccatcccaaaatgacagagctctaccagtctttagctgacctgaataatgtca 



gattctcagcttataggactgccatgaaactccgaagactgcagaaggccctttgcttggatctcttgagcctgtcagct 



gcatgtgatgccttggaccagcacaacctcaagcaaaatgaccagcccatggatatcctgcagattattaattgtttgac 



cactatttatgaccgcctggagcaagagcacaac^^ 



tgctgaatgtttatgatacgggacgaacagggaggatccgtgtcctgtcttttaaaactggcatcatttccctgtgtaaa 



gcacatttggaagacaagtacagataccttttcaagcaagtggcaagttcaacaggamtgtgaccagcgcaggctggg 



Si cctccttctgcatgattctatccaaattccaagacagttgggtgaagctgcatcctttgggggcagtaacattgagccaa 

Si 

a 

jp gtgtccggagctgcttccaatttgctaataataagccagagatcgaagcggcccicttcctagactggatgagactggaa 



05 

P ccccagtccatjggtgtggctgcccgtcctgcacagagtggctgctgcagaaactgccaagcatcaggccaaatgtaacat 



ctgcaaagagtgtccaatcattggattcaggtacaggagtciaaagcactttaanatgacatctgccaaagctgctttt 



tttctggtcgagttgcaaaaggccataaaatgcactatcccatggtggaatactgcaciccgaciacatcaggagaagat 



gttcgagactttgccaaggtactaaaaaacaaatttcgaaccaaaaggtaitttgcgaagcatccccgaatgggctacct 



gccag^gcagactgtctUgagggggacaacatggaaactcccgacacaatgiagicgagaggcctaataaagagctcag 



atgcatcgatcagagtgtgtt, 
TCTCTGCG 



ggttttt^tgtgaGATCTAGGAACCCCTAGTGATGGAGTTGGCCACTCCC 



CCGCCGGGGCAAAGCCCGGGCGTCGGGCGACCTTTG 



CGCTCGCTCGCTCACTGAGG 
GTCGCCCGGCCTCAGTGAGCGAGC 

GAGCGCGCAGAGAGGGAGTGGCCAA 



Seq. ID No 7: AAV-CMV-A3990. It is an AAV vector containing a CMV promoter driving a 
mini-dystrophin gene connected to a small poly A signal 

CP 
D 

© TTGGCCACTCCCTCTCTGCGCGCrCGCTCGCTCACTGAGGCCGGGCGACCAAAGGTC 

0 

SJ GCCCGACGCCCGGGCTTTGCCCG 
M 

| GGCGGCCTCAGTGAGCGAGCGAGCGCGCAGAGAGGGAGTGGCCAACTCCATCACTA 

fU GGGGTTCCT AGATCTGA ATTCGGT 
00 

0 ACCCGTTACATAACTTACGGTAAATGGCCCGCCTGGCTGACCGCCCAACGACCCCCG 
CCTATTGACGTCAATAATGACGT 



ATGTTCCCATAGTAACGCCAATAGGGACTTTCCATTGACGTCAATGGGTGGAGTATT 
TACGGTAAACTGCCCACTTGGCA 

GTACATCAAGTGTATCATATGCCAAGTACGCCCCXrrATTGACGTCAATGACGGTAAA 
TGGCCCGCCTGGCATTATGCCCA 



36 



GTACATGACCITATGGGA 
ATTACCATGGTGATGCGGTTTT 



GGCAGTACATGAATGGGCGTGGATAGCGGTTTGACTCACGGGGATTTCCAAGTCT 
ACCCCATTGACGTCAATGGGAGT 



TTOTmGGCACCAAAATCAACGGGACTITCCAAAATGTCGTAACAACTCCGCCCCA 
TTGACGCAAATGGGCGGTAGGCG 



TGTACGGTGGGAGGTCTATATAAGCAGAGCTCGTTTAGTGAACCGTCAGATCGCCTG 
§ GAGACGCCATCCACGCTGTTTTG 

0 

5 

VI ACCTCCATAGAAGACACCGGGACCGATCCAGCCTCCGGACTCTAGAGGATCCGGTA 
M CtcgaAttttcaCCatgGtttggt 



W gggaagaagiagaggactgttatgaaagagaagatgttcaaaagaaaacattcacaaaatgggtaaatgcacaattttct 
0 

aagtttgggaagcagcatattgagaacctcttcagtgacctacaggatgggaggcgcctcctagacctcctcgaaggcct 



gacagggcaaaaactgccaaaagaaaaaggatccacaagagttcatgccctgaacaatgtcaacaaggcactgcgggttt 



tgcagaacaataatgngatttagtgaatattggaagtactgacatcgtagatggaaaicataaactgactcttggtttg 



atttggaatataatcctccaciggcaggtcaaaaatgtaa^ 



aaagattctmgagctgggtccgacaatcaacfcg^ 



ctgatggcctggctttgaatgctctcatccatagtcataggccagacctatttgactggaatagtgtggtttgccagcag 



tcagccacacaacgactggaacatgcattcaacatcgccagatatcaattaggcatagagaaactactcgatcctgaaga 



tgttgataccacctatccagataagaagtccatcttaatgtacatcacatcaclcttccaagttttgcctcaacaagtga 



gcattgaagccatccaggaagtggaaatgttgccaaggccacctaaagtgactaaagaagaacattttcagttacatcat 



caaatgcactattctcaacagatcacggtcagtctagcacagggatatgagagaacttcttcccctaagcctcgattcaa 



w gagctatgcctacacacaggctgcttatgtcaccacctctgaccctacacggagcccatttccttcacagcatttggaag 

[V 

P 

P 

Si ctcctgaagacaagtcatt^gcagttcattgatggagagtgaagtaaacctggaccgltatcaaacagctttagaagaa 

1 

gtattatcgtggGttctttctgctgaggacacattgcaagcacaaggagagatttctaatgatgtggaagtggtgaaaga 



03 

gj ccagtttcatactcatgaggggtacatgatggatttgacagcccatcagggccgggttggtaatattctacaattgggaa 

o 



gtaagctgattggaacaggaaaattatcagaagatgaagaaactgaagtacaagagcagatgaatctcctaaattcaaga 



tgggaatgcctcagggtagctagcatggaaaaacaaagcaatttacatagagttttaatggatctccagaatcagaaact 



gaaagagttgaatgactggctaacaaaaacagaagaaagaacaaggaaaatggaggaagagcctcttggacctgatcttg 



aagacctaaaacgccaagtacaacaacataaggtgcttcaagaagatctagaacaagaacaagtcagggtcaattctctc 



actcacatggt 



ggtggtasttgatgaatctagtggagatcacgcaactgctgctttggaagaacaacttaaggtattggg 



agatcgatgggcaaacatctgtagatggacagaagaccgctgggnctmacaagaccagcctgaccta 

tgaccactattggagcctctcctactcaBactgttactctggtgacacaacctgtggttaw 
aaactagaaatgccatct^ttgatgttggaggtacctactcatagatmctgcaacagnecccctggacctggaaaa 

gtttcttgcctggcttacagaagctgaaacaactgccaatgtccucaggatgctacccgtaaggaaaggctw 

0 actccaagggagtaaaagagclgatgaaacaatggcaagacctccaaggtgaaattgaagctcacacagatgtttatcac 

1 aacctggatgaaaacagccaaaaaatcctgagatccctggaaggttccgatgatgcag^ 



.to 



taacatgaacttcaagtggagtgaacttcggaaaaagtctctcaacattaggtcccatnggaagccagnctgaccagt 



ggaagcgtctgcacctttctctgcaggaacttctggtgtggctacagctgaaagatgatgaattaagccggcaggcacct 



ittggaggcgactttccagcagttcagaagcagaacgatgtacaiagggccttcaagagggaattgaaaactaaagaacc 



tgtaatcatgagtactcttgagactgtacgaatatttctgacagagcagcctttggaaggaciagagaaactctaccagg 



agcccagagagctgcctcctgaggagagagcccagaatgtcactcggcttctacgaaagcaggctgaggaggtcaataot 



gagtgggaaaaattgaacctgcactccgctgactggcagagaaaaatagatgagacccttgaaagactccaggaacttca 



0J 



agaggccacggatgagctggacctcaagctgcgccaagctgaggtgatcaagggatcctggcagcccgtgggcgatctcc 



tcattgactctctccaagatcacctcgagaaagtcaaggcacttcgaggagaaattgcgcctctgaaagagaacgtgagc 



cacgtcaatgaccttgctcgccagcttaccactttgggcattcagctctcaccgtataacctcagcactctggaagacct* 



gaacaccagatggaagcttctgcaggtggccgtcgaggaccgagtcaggcagctgcatgaagcccacagggactttggtc 



cagcatctcagcactttctttccacgtctgiccagggtccctgggagagagccatctcgccaaacaaagtgccctactat 



atcaaccacgagactcaaacaacttgctgggaccatcccaaaatgacagagctctaccagtctttagctgacctgaataa 



/*j tgtcagattctcagctutaggactgccatgaaactccgaagactgcagaaggccctttgcttggatctcttgagcctgt 

® cagctgcatgtgatgccttggaccagcacaacctcaagcaaaatgaccagcccatggatatcctgcagattattaattgt 
ttgaccactatttatgaccgcctggagcaagagcacaacaatttggtcaacgtccctctctgcgtggatatgtgtctgaa 



ctggctgctgaatgtttatgatacgggacgaacagggaggatccgtgtcctgtctttlaaaactggcatcatttccctgt 



giaaagcacantggaagacaagtacagaiaccttttcaagcaagtggcaagttcaacaggattttgtgaccagcgcagg 



ctgggcctccnctgcatgattctatccaaaticcaagacagttgggtgaagttgcatcctttgggggcagtaacattga 



gccaag^gtcxggagccgcttccaamgctaamtaagccagagatcgaagcggccctcttcctagactggatgagac 



tggaac^cagtccatggtgtggctgw^ 

aacatctgcaaagagtgtccaatcattggattc * 

' * ' * ■ * * 

ctttmtctggtcgagttgcaaaaggcca^ 

aagatgttcgagactttgccaagg^actaaaaaacaaatttcgaaccaaaaggtattttgcgaagcalccccgaatgggc 

* 

tacctgccagtgcagactgtctlagagggggacaacatggaaactcccgacaGaatgmgtcgagaggccUataaagag 

ctcagatgcatcgatcagagtgtgttggtmttgtgtgaGATCTAGGAACCCCTAGTGATGGAGTTGG 
CCCTCT 

i 

ctgcck:gctcgctcgctcactgaggccgcccgg^ 
ctttggtcgcccggcctcagtgag 

CGAGCGAGCGCGCAGAGAGGGAGTGGCCAA 

Seq. ID No 8: AAV-CMV-A3849. It is an AAV vector containing a CMV promoter driving a 
mini-dystrophin gene connected to a small polyA signal 

TGGCCACTCCCTCTCTGCGCG 
CCCGACGCCCGGGCTTTGCCCGG 

GCGGCCTCAGTGAGCGACTCGAGCGCGCAGAGAGGGAGTGGCCAACTCCATCACTAG 
GGGTrcCTAGATCTGAATTCGGTA 



CCCGTTACATAACTTACGGTAA^^ 
CCATTGACGTCAATAATGACGTA 

TGTTCXCATAGTAACGCCAATAGGGACTITCCATTGACGTCAATGGGTGGAGTATTT 
ACGGTAAACTGCCCACTTGGCAG 

* - 

TACATCAAGTGTATCATATGCCAAGTACGCCCCCTATTGACGTCAATGACGGTAAAT 
GGCCCGCCTGGCATTATGCCCAG 

TACATGACCTTATGGGACTTTCCTACTO 
TTACCATGGTGATGCGGTnTG 

GCAGTACATCAATCHjGCGTGGATAGCGGTTTGACTCACGCHjGATTTGCAAGTCTCCA > 
CCCCATTGACGTCAATGGGAGTT 

TGTTTTGGCACCAAAATCAACGGGACTTTCCAAAATGTCGTAACAACTCCGCCCCAT 
TGACGCAAATGGGCGGTAGGCGT 

GTACGGTGGGAGGTCTATATAAGCAGAGCTCGTTTAGTGAACCGTCAGATCGCCTGG 
AGACGCCATCCACGCTGTTTTGA 

CCTCCATAGAAGACACCGGGACCGATCCAGCCTCCGGACTCTAGAGGATCCGGTACt 
cgaAttttcaCCatgGtttggtg 



ggaagaagtagaggactgttatgaaagagaagatgttcaaaagaaaacattcacaaaatgggtaaatgcacaatcttcta 



agmgggaagcagcatattgaga^ 
acagggcaaaaactgccaaaagaaaaaggatccacaagagtt^ 

gcagaacaataatgttgatttagtgaatattggaagtactgacatcgtagatggaaatcataaactgactcttggtttga 

tttggaatataatcctccactggcaggtcaaaaatgtaatgaaaaatatcatggctggattgcaacaaaccaacagtgaa 

aagattctcctgagctgggtccgacaatcaactcgtaattatccacaggtuatgtaatcaacttcaccaccagctggtc 

tgatggcctggctttgaatgctctcatc^ 

cagccacacaacgactggaacatgcattcaacatogccagatatcaauaggcatagagaaactactcgatcctgaagat 

gttgataccacctatccagataagaagtccatcttaatgtacatcacatcactcnccaagnttgcctcaacaagtgag 

cattgaagccatccaggaagtggaaatgttgccaaggccacctaaagtgactaaagaagaacattucagttacatcatc 

aaatgcactattctcaacagatcacggtcagtctagcacagggatatgagagaacttcttcccctaagcctcgattcaag 

agctatgcctacacacaggctgcttatgtcaccacctcLgaccctacacggagcccatttccttcacagcatttggaagc 

tcctgaagacaagtcatttggcagttcattgatggagagtgaagtaaacctggaccgttatcaaacagcmagaagaag 

latiatcgtggcttctttctgctgaggac^angcaagcacaaggagagautctaatgatgiggaagtggtg 



cagtttcatactcatgaggggtacatgatggatttgacagcccatcagggccgggttggtaatattctacaattgggaag 



taagctgattggaacaggaaaattatcagaagatgaagaaactgaagtacaagagcagatgaatctcctaaattcaagat 



gggaatgcctcagggtagctagcatggaaaaacaaagcaatttacatagagttliaatggatctccagaatcagaaactg ( 



aaagagttgaatgactggctaacaaaaacagaagaaagaacaaggaaaatggaggaagagcctcttggacctgatcttga 



agacctaaaacgccaagtacaacaacataaggtgcttcaagaagatctagaacaagaacaagtcagggtcaattctctca 



£ ctcacatggtggtggtagttgatgaatctaglggagatcacgcaactgctgctttggaagaacaacttaaggtattggga 

ru 

jjj gatcgatgggcaaacatctgtagatggacagaagarcgct^ 

si 

* ccccctggacctggaaaag^ttgcctggcttac 



tP aggaaaggctcctagaagactccaagggagtaaaagagctgatgaaacaatggcaagacctccaaggtgaaattgaagct 
0 

0 

cacacagatgtttatcacaacctggatgaaaacagccaaaaaatcctgagatccctggaaggttccgatgatgcagtcct 



gttacaaagacgtttggataacatgaacttcaagtggaglgaacttcggaaaaagtcictcaacattaggtcccatttgg 



aagccagUctgaccagtggaagcgtctgcacctttctctgcaggaacttctggtgtggctacagctgaaagatgatgaa 



naagccggcaggcacctattggaggcgactttccagcagttcagaagcagaacgatgtacatagggccttcaagaggga 



AA 



attgaaaactaaagaacctgtaatcatgagtactcttgagactgcacgaatatttctgacagagcagcctttggaaggac 



tagagaaactctaccaggagcccagagagctgcctcctgaggagagagcccagaatgtcactcggcttctacgaaagcag 



gctgaggaggtcaatactgagtgggaaaaattgaacctgcactccgctgactggcagagaaaaatagatgagacccttga 



aagactccaggaacttcaagaggccacggatgagctggacctcaagctgcgccaagctgaggtgatcaagggatcctggc 



agcccgtgggcgatctcctcattgactctctccaagatcacctcgagaaagtcaaggcacttcgaggagaaattgcgcct 



£ etgaaagagaacgtgagccacgtcaatgaccttgctcgccagcttaccactttgggcattcagclctcaccgtataacct 



0 

0 cagcactctggaagacctgaacaccagatggaagcttctgcaggtggccgtcgaggaccgagtcaggcagctgcatgaag 

1 cccacagggactttggtccagcatctcagcactttctttccacgtctgtccagggtccctgggagagagccatctcgcca 
0 

i 

00 aacaaagtgccctactatatcaaccacgagactcaaacaacngctgggaccatcccaaaatgacagagctctaccagtc 



+ • 

tttagctgacctgaataatgtcagattctcagcttataggactgccatgaaactccgaagactgcagaaggccctttgct 



tggatcicttgagcctgtcagctgcatgtgatgccttggaccagcacaacctcaagcaaaatgaccagcccatggatatc 



ctgcagattattaattgmgaccactatttatgaccgcctggagcaagagcacaacaatttggicaacgtccctctctg 



cgtggatatgtgtctgaactggctgctgaatgtttatgatacgggacgaacagggaggatccgtgtcctgtcttttaaaa 



ctggcatcatttccctgtgtaaagcacatttggaagacaagtacagataccttttcaagcaagtggcaagttcaacagga 



ttngtgaccagcgcaggctgggcctccttctgcatgattctatccaaattccaagacagttgggtgaagugcaicctt 



tgggggcagtaacattgagccaagtgtccggagctgcttccaatttgctaataataagccagagatcgaagcggccctct 



tcctagactggatgagactggaaccccagtccatggtgtggctgcccgtcctgcacagagtggctgctgcagaaactgcc 



aagcatcaggccaaatgtaacatctgcaaagagtgiccaatcattggattcaggtacaggagtctaaagcactttaana 



tgacatctgccaaagctgctmtttctggtcgagttgcaaaaggccataaaatgcactatcccatggtggaatattgca 

ru 

0 ctccgactacatcaggagaagatgttcgagactttgccaaggtactaaaaaacaaatttcgaaccaaaaggtattttgcg 
Si 

• kagcatccccgaatgggctacctgccagtgcagactgtcttagagggggacaacatggaaactcccgacacaatgtagtc 

£P * gagaggcctaataaagagctcagatgcatcgatcagagtgtgttg 



GAGTTGGCCACTCCCTCTCTGCGCGCT^ 
CCCGGGCGTCGGGCGACCTTTGG 



TCGCCCGGCCTCAGTGAGCGAGCGAGCGCGCAGAGAGGGAGTGGCCAA 



46 



Seq. ID No 9: AAV-E-CMV-A3849: It is an AAV vector containing a MCK enhancer with a 
CMV promoter driving a mini-dystrophin gene connected to a small polyA signal 



TTGGCCACTCCCrCTCTGCGCGCTCGCTCGCTCACTGAGGCCGGGCGACCAAAGGTC 
GCCCGACGCCCGGGCTTTGCCCG 

GGCGGCCTCAGTGAGCGAGCGAGCGCGCAGAGAGGGAGTGGCCAACTCCATCACTA 
GGGGTTCCTAGATCTGAATTCGGT 

s 

W ACcactacgggtctaggctgcccatgtaaggaggcaaggcctggggacacccgagatgcctggttataattaacccagac 

SI , 

atgtggctgccctxccccccccaa^^ 



P 

CO TGGCC 



P 



ccatggaggagaagctcgctctmaataaccct^ 



cgcctggctgaccgcccaacgaccccggcccattgacgtcaAtaatgacgtatgttc 
ccatagtaacgccaatagggact 

ttccattgacgtcaatgggtggagtatttacggtaaactgcccacttggcagtacat 
caagtgtatcatatgccaagtac 

GCCCCCrATTGACC?TCAATOACGGTAAATQGCCCOCCTGGCATTATOCCCAGTACAT 
GACCTTATGGGACTTTCCTACTT 



47 



GGCAGTACATCTACGTATTAGTCATCGCTATTACCATGGTGATGCGGTTTTGGCAGT 
ACATCAATGGGCGTGGATAGCGG 



TTTGACTCACGGGGATTTCCAAGTCTCGACCCCATTC 
GGCACCAAAATCAACGGGACTT 



TCCAAAATGTCGTAACAACTCCGCCCCATTGACGCAAATGGGCGGTAGGCGTGTAC 
GGTGGGAGGTCTATATAAGCAGAG 



CTCGTTTAGTGAACCGTCAGATCGCCTGGAGACGCCATCCACGCTGTTTTGACCTCC 
^ ATAGAAGACACCGGGACCGATCC 

^ AGCCTCCGGACTCTAGAGGATCCGGTAQcgaAt^^ 
J|j gaaaga 



gaajatgttcaaaagaaaacattcacaaaatgggtaaatgcacaattttctaa^tttgggaagcagcatattgagaacct 
IP 

0 

& cttcagtgacctacaggatgggaggcgcctcctagacctcctcgaaggcctgacagggcaaaaactgccaaaagaaaaag 



gatccacaagagttcatgccctgaacaatgtcaacaaggcactgcgggttttgcagaacaataatgttgattlagtgaat 



attggaagmctgacatcgtagatggaaatcataaactgactcttggtttgatttggaatataatcctccactggcaggt 



caaaaatgtaatgaaaaatatcatggaggattgcaacaaaccaacag^gaaaagattctcctgagctgggtccgacaat 



caactcgtaaaatccacaggttaatgtaatcaaatc.acc 



48 



catagtcataggccagacctatttgactggaatagtgtggtttgccagcagtcagccacacaacgactggaacatgcatt 



caacatcgccagatatcaattaggcatagagaaactactcgatcctgaagatgttgataccacctatccagataagaagt 



ccatcttaatgtacatcacatcactcttccaagttttgcctcaacaagtgagcattgaagccatccaggaagtggaaatg 



ttgccaaggccacctaaagtgaciaaagaagaacattttcagttacatcatcaaatgcactattctcaacagatcacggt 



cagtctagcacagggatatgagagaacttcttcccctaagcctcgattcaagagctatgcctacacacaggctgcttatg 



jj£ tcaccacctctgaccctacacggagcccatttccttcacagcatttggaagctcctgaagacaagtcatuggcagttca 
ttgatggagagtgaagtaaacctggaccgttatcaaacagctttagaagaagtattatcgtggcttctttctgctgagga 

SI 

M 

a cacangcaagcacaaggagagatttctaatgatgtggaagtggtgaaagaccagtttcatactcatgaggggtacatga 



qj tggatttgacagcccatcagggccgggttggtaatattctacaattgggaaguagctgattggaacaggaaaattatca 

0 

0 

gaagatgaagaaactgaag^acaagagcagatgaacctcctaaattcaagatgggaatgcctcagggtagctagcatgga 



aaaacaaagcaamacatagagttttaatggatctccagaatcagaaactgaaagagttgaatgactggctaacaaaaa 



cagaagaaagaacaaggaaaatggaggaagagcctcaggacctgatcttgaagacctaaaacgccaagtacaacaacat 



aaggtgcttcaagaagatctagaacaagaacaagtcagggtcaattctctcactcacatggtggtggtagttgatgaatc 



49 



tagtggagatcacgcaactgctgctttggaagaacaacttaaggtattgggagatcgatgggcaaacatctgtagatgga 



cagaagaccgctgggttcttttacaagacactcatagattactgcaacagttccccctggacctggaaaagtttcttgcc 



tggcttacagaagctgaaacaactgccaatgtcctacaggatgctacccgtaaggaaaggctcctagaagactccaaggg 



agtaaaagagctgatgaaacaatggcaagacctccaaggtgaaattgaagctcacacagatgtttatcacaacctggatg 



aaaacagccaaaaaatcctgagatccctggaaggttccgatgatgcagtcctgttacaaagacgtnggataacatgaac 



<P ttcaagtggagtgaacttcggaaaaagtctctcaacattaggtcccamggaagccagttctgaccagtggaagcgtct 
0 

ru 

P gcacctttctctgcaggaacttctggtgtggctacagctgaaagatgatgaattaagccggcaggcacctattggaggcg 

Si 

SJ 
VJ 

, actttccagcagttcagaagcagaacgatgtacatagggccticaagagggaattgaaaactaaagaacctgtaatcatg 

0 



Qg agtactcttgagactgtacgaatatttctgacagagcagcctttggaaggactagagaaactctaccaggagcccagaga 



gctgcctcctgaggagagagcccagaatgtcactcggcttctacgaaagcaggctgaggaggtcaatactgagtgggaaa 



aaUgaacctgcactccgctgactggcagagaaaaatagatgagacccttgaaagactccaggaacttcaagaggccacg 



gatgagctggacctcaagctgcgccaagctgaggtgatcaagggatcctggcagcccgtgggcgatctcctcattgactc 



tctccaagatcacctcgagaaagtcaaggcactlcgaggagaaattgcgcctctgaaagagaacgtgagccacglcaatg 
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accttgctcgccagcttaccactttgggcattcagctctcaccgtataacctcagcactctggaagacctgaacaccaga 



tggaagcttctgcaggtggccgtcgaggaccgagtcaggcagctgcatgaagcccacagggactttggtccagcatctca 



gcactttctttccacgtctgtccagggtccctgggagagagccatctcgccaaacaaagtgccctactatatcaaccacg 



agactcaaacaacttgctgggaccatcccaaaatgacagagctctaccagtctttagctgacctgaataatgtcagattc 



tcagcttataggactgccatgaaactccgaagactgcagaaggccctttgcttggatctcttgagcctgtcagctgcatg 



CP 

q tgatgccttggaccagcacaacctcaagcaaaatgaccagcccatggatatcctgcagattattaattgtttgaccacta 

ru 
q 

9 tttatgaccgcctggagcaagagcacaacaatttggtcaacgtccctctctgcgtggatatgtgtctgaactggctgctg 

SI 
>i 

* aatgmatgatacgggacgaacagggaggatccgtgtcctgtcttttaaaactggcatcatttccctgtgtaaagcaca 
5» 



W tttggaagacaagtacagataccttttcaagcaagtggcaagttcaacaggattttgtgaccagcgcaggctgggcctcc 
ttctgcatgattctatccaaauccaagacagttgggtgaagttgcatcctagggggcagtaacattgagccaagtgtc 



cggagctgcttccaatttgctaataataagccagagatcgaagcggccctcttcctagactggatgagactggaacccca 



gtccatggtgtggctgcccgtcctgcacagagtggctgctgcagaaactgcxaagcatcaggccaaa^gtaacatctgca 



aagagtgtrcaatcattggattcaggtacaggagtctaaagcactt^ 
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ggtcgagttgcaaaaggccataaaatgcactatcccatggtggaatattgcactccgactacatcaggagaagatgttcg 



S! 



a 



agactttgccaaggtactaaaaaacaaatticgaaccaaaaggtattttgcgaagcaiccccgaatgggctacctgccag 



tgcagactgtcttagagggggacaacatggaaactcccgacacaatgtagtcgagaggcctaataaagagctcagatgca 

tcgatcagagtgtgttggttttttgtgtgaGATCTAGGAACCCCTAGTGATGGAGTTGGCCACTCCCTCT 
CTGCGCGCTC 

GCTCGCTCACTGAGGCCGCCCGGGCAAAGCCCGGGCGTCGGGCGACCTTTGGTCGC 
CCaaCCTCAGTCrAfirriAGCGAGCG 



m 

jjp CCGGCCTCAGTGAGCGAGCGAGCG 

fU 
P 

§ CGCAGAGAGGGAGTGGCCAA 



si 

I 

0 



DETAILED DESCRIPTION AND PREFERRED EMBODIMENTS OF THE 

INVENTION 

To explore the feasibility of using AAV vectors for DMD gene therapy, we have devised 
strategies to create novel truncated dystrophin genes, which are small enough to be packaged 
into AAV vectors, and yet retain the essential functions needed to protect muscle from the 
pathological symptoms. 

A. Definitions 



In describing the present invention, the following terms will be employed, and are 
IU intended to be defined as indicated below, 

JO 

>g "Gene transfer" or "gene delivery" refers to methods or systems for reliably inserting a 



particular nucleotide sequence (e.g. t DNA) into targeted cells. Such methods preferably result in 
the integration of the transferred genetic material into the genome of target cells. Gene transfer 



gj provides a unique approach for the treatment of acquired and Inherited diseases, and a number of 

0 



systems have been developed in the art for gene transfer into mammalian cells. See, e.g., U.S. 
Pat. No. 5,399,346. 



By "vector" is meant any genetic element, such as a plasmid, phage, transposon, cosmid, 
chromosome, virus, virion, etc., which is capable of replication when associated with the proper 
control elements and which can transfer gene sequences between cells. Thus, the term includes 
cloning and expression vehicles, as well as viral vectors. 



By an "AAV vector" is meant a vector derived from an adeno-associated virus serotype, 
including without limitation, AAV-1, AAV-2, AAV-3, AAV-4. AAV-5, AAVX7, etc. AAV 
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vectors can have one or more of the AAV wild-type genes deleted in whole or part, preferably 
the rep and/or cap genes/but retain functional flanking ITR sequences. AAV vectors can be 
constructed using recombinant techniques that are known in the art to include one or more 
. heterologous nucleotide sequences flanked on both ends (5* and 3') with functional AAV ITRs. 

In the practice of the invention, an AAV vector can include at least one AAV ITR and a suitable 

> 

promoter sequence positioned upstream of the heterologous nucleotide sequence and at least one 
AAV ITR positioned downstream of the heterologous sequence. A "recombinant AAV vector 
plasmid" refers to one type of recombinant AAV ector wherein the vector comprises a plasmid. 
As with AAV vectors in general, 5' and 3' ITRs flank the selected heterologous nucleotide 
sequence. AAV vectors can also include transcription sequences such as polyadenylation sites, as 
Q well as selectable markers or reporter genes, enhancer sequences, and other control elements 



fU 



which allow for the induction of transcription. Such control elements are described more fully 



\j below. 

\l 
Si 

gj As used herein, the term "AAV virion" or "AAV particle" refers to a complete virus 

jjj particle. An AAV virion may be a wild type AAV virus particle (comprising a linear, single- 
^ stranded AAV nucleic acid genome associated with an AAV capsid, i.e., a protein coat), or a 
recombinant AAV virus particle (described below). In this regard, single-stranded AAV nucleic 
acid molecules (either the sense/coding strand or the antisense/anticodirig strand as those terms 

* 

are generally defined) can be packaged into an AAV virion; both the sense and the antisense 
strands are equally infectious. 



As used herein* the term "recombinant AAV virion", "recombinant AAV particle" or 
"rAAV" is defined as an infectious, replication-defective virus composed of an AAV protein 
shell encapsidating (i.e., surrounding with a protein coat) a heterologous nucleotide sequence, 
which in turn is flanked 5* and 3* by AAV ITRs. A number of techniques for constructing 
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ru 



recombinant AAV virions are known in the art. See, e.g., U.S. Pat. No. 5,173,414; International 

Publication Numbers WO 92/01070 (published 23 Jan. 1992) and WO 93/03769 (published 4 
Mar. 1993); Lebkowski et al. (1988) Molec. Cell. Biol. 8:3988-3996; Vincent et al. (1990) 
Vaccines 90 (Cold Spring Harbor Laboratory Press); Carter, BJ. (1992) Current Opinion in 
Biotechnology 3:533-539; Muzyczka, N. (1992) Current Topics in Microbiol, and Immunol. 
158:97-129; Kotin, R. M. (1994) Human Gene Therapy 5:793-801; Shelling and Smith (1994) 
Gene Therapy 1:165-169; and Zhou et al. (1994) J. Exp. Med. 179:1867-1875. 



The term "mini-dystrophin gene" is used to refer to the novel dystrophin constructs 
created by extensive deletions in the central rod domain plus extensive deletion in the C-terminal 
domain of the human dystrophin cDNA. In addition, the mini-dystrophin genes also contain 
artificial DNA sequences surrounding the original protein translation initiation codon ATG. The 



artificial sequences enhances the mini-dystrophin protein synthesis. The mini-dystrophin genes 
^ are smaller than the 5-kilobase packaging limit of AAV viral vectors. And most importantly, the 
P mini-dystrophin genes harbor biological functions that can protect the muscle from dystrophic 
jjj pathology and symptoms. 

03 
D 

0 The symbol "A" (delta) is a prefix for the mini-dystrophin genes that contain deletions as 

described above. 



B. General Methods 

Dystrophin is an enormous rod-like protein(427 kDa) localized beneath the inner surface 
of muscle ceil membrane. [Watkins, S. C. et al., Nature 333, 863-866 (1988)]. It functions 
through four major structural domains. The N -terminal domain binds to the F-actin of 
cytoskeletal structures, while die cysteine-rich (CR) domain along with the distal C-tenninal 
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domain anchors to the cell membrane via dystrophin-associated protein (DAP) complexes, thus, 
dystrophin crosslinks and stabilizes the muscle cell membrane and cytoskeleton. The central rod 
domain contains 24 triple-helix rod repeats and 4 hinges. [Koenig, M. et al, J Biol Ckem 265, 
4560-4566 (1990)]. It presumably functions as a "shock absorber" during muscle contraction. 
Interestingly, a number of mild muscular dystrophy patients, although they endure large in-frame 
deletions in the central rod domain, suffer only slight symptoms. [England, S. B. et al Nature 
343, 180-182 (1990); Passos-Bueno, M. R. et al., Hum Mol Genet 3, 919-922 (1994); and 
Mirabella, M. et al Neurology 51, 592-595 (1998)]. This phenomenon suggests that a major 
portion of the rod domain is dispensable. In addition, transgenic studies in max mice showed that 
two deletion mutants in C-terminus, one lacking exons 71-74 and the other lacking exons 75-78, 
displayed full functions in preventing dystrophic phenotypes. [Rafael, J. A. et al Journal of Cell 
Biology 134, 93-102 (1996)]. These findings suggest that the distal C-termina! region (exons 71- 
Q 74 and 75-78) may also be dispensable. In contrast, N-terminal deletions variably impan- 
el dystrophin functions. [Corrado, K. et al J Cell Biol 134, 873-884 (1996)]. We have created by 
^ rational design several mini-genes, in each deleting up to 3/4 of the central rod domain (19 rods 

M and 2 hinges) and nearly the entire distal C-terminal domain (exons 7 1 -78) {Fig, 1). These mini- 
XI 

j genes have enabled us to re-examine a previous hypothesis that a dystrophin could not be made 
9 smaller than 50% of the full length without causing muscular dystrophies. [ Fanin, M. et 
al Muscle Nerve 19, 1 154-1 160 (1996)]. These novel mini-dystrophin genes, representing only 
one third (1/3) of the 1 1 kb full-length dystrophin coding sequence, are significantly smaller than 
the 6.3 kb Becker-form mini-dystrophin [England, S. B. et al Nature 343, 180-182 (1990)] that 
was previously widely used in transgenic studies and gene therapy studies. [Ragot, T. et al 
Nature 361, 647-650 (1993); Cox, G. A. et al Nature 364, 725-729 (1993); and Wells, D. J. et 
al Hum Mol Genet 4, 1245-1250 (1995)]. To ensure sufficient physical flexibility of the protein, 
all of our mini-dystrophins still retain at least five rod repeats (Rl , R2, R22, R23 & R24) and 2 
hinges (HI & H4) in the central rod domain (Fig. 1). Construct A4 173 has an additional rod (R3), 
white A3990 has an additional hinge (H3) (Fig. 1). 



OS 



To investigate the functionality of the novel mini-dystrophin constructs, it is essential to 
demonstrate that they can protect muscle from the dystrophic phenotype. The onset of the 
phenotype in mdx mice starts at around three weeks of age with massive waves of myofiber 



degeneration/regeneration. Regenerated myofibers are characterized by the presence of central 
nuclei, while the myonuclei in normal myofibers are peripherally located. The presence of 
central nuclei is a primary pathological sign of muscular dystrophies. The absence of central 
nucleation after gene therapy would suggest that the therapy was successful. However even in 
healthy mice, a majority of the myonuclei remain centrally located after experiencing a transient 
pathology such as myotoxin treatment, [Martin, H. et al., Muscle Nerve 11, 588-596 (1988)]. 
Because adult mdx muscles already have extensive central nucleation, this makes it complex to 
evaluate the benefits of gene therapy by judging the status of myonuclei. Therefore, we chose to 
test the AAV mini-dystrophin constructs in young mdx mice (10-day old) before the onset of 
• central nucleation, so that the therapeutic effects of the mini-genes can be evaluated with 
certainty. Furthermore, equally crucial is the alleviation of other pathological signs, including 
wide variation of myofiber diameters, round (non-polygonal) myofiber shapes in transverse 
sections, proliferation of connective tissues (fibrosis), and finally loss of muscle cell membrane 
integrity. 

To investigate the therapeutic effects in mdx mice, we injected into the hindleg muscle 
(gastrocnemius) of 10-day old pups with the novel mini-dystrophin genes, which were packaged 
into AAV vectors containing an MCK (muscle-specific creatine kinase) promoter [Shield, M. A. 
et al. f Mol Cell Biol 16, 5058-5068 (1996)] to ensure muscle-specific expression. Three months 
after vector injection, the muscles were collected to evaluate mini-dystrophin expression, as well 
as for biochemical restoration of the DAP complexes, which are absence due to the primary 
deficiency of dystrophin. Irnmuno fluorescent staining on thin sections of AAV treated muscles, 
using an antibody (Dys3) specific to the human dystrophin N-tenninal region, revealed wide- 
spread vector transduction and correct submembrane location of the mini-dystrophins in a 
majority of the myofibers, especially in muscles treated with vector AAV-MCK-3849 or AAV- 
MCK-3990 (Fig. 2a & 2b and Table 1). As expected, the equivalent muscle from the age- 
matched healthy C57/B10 mice showed indistinguishable dystrophin staining pattern, when 
stained with an antibody (Dys2) that recognizes both mouse and human dystrophin C-terminal 
region. However, this antibody (Dys2) failed to stain the AAV treated mdx muscle (data not 
shown) due to deletion of this region in our mini-dystrophin genes, confirming the identity of the 

* 

vector-derived transgene products. Consistently, the untreated mdx control muscle showed no 



dystrophin staining (Fig. 2) except the very few somatic revertant myofibers recognized by Dys2 
antibody. We next examined whether the mini-dystrophins were functional in restoring the 
missing DAP complexes, including the sarcoglycan complex on the myofiber membrane which 
is not found in untreated dystrophic muscle. Immunofluorescent staining using three antibodies 
against a, P, and y sarcoglycans respectively, showed positive results in all of the consecutive 
thin sections adjacent to those stained with dystrophin antibodies (Fig. 2). These results provided 
evidence of biochemical functionality of the mini-dystrophins (lacking the entire distal C- 
terminal region) in interactions with the DAP complexes. 

Histological examination of the AAV mini-dystrophin treated muscles showed nearly 
exclusive (-99%) peripheral nucleation in the mini-dystrophin positive myofibers, as revealed by 
dystrophin immunostaining and myonuclei counterstaining with DAPI (Fig. 2 and Table 1 ). The 
mutual exclusivity between mini-dystrophin expression and central nucleation in the vector 
treated mdx muscle precisely mirrors that of the normal muscle (Table 1). In addition, the 
myofibers positive for mini-dystrophin expression also exhibited consistent myofiber sizes and 
polygonal shapes indistinguishable from those of the normal muscles (Fig. 2). By contrast, the 
untreated mdx muscle showed extensive central nucleation (Table 1), with additional signs of 
dystrophic pathology including wide variation of myofiber sizes, round myofiber shapes, and 
fibrosis (Fig.2). Hence, vector treatment eliminates dystrophic pathology and led to normal 
histology in terms of peripheral nucleation, consistent myofibre size and lack of fibrosis in the 
mini-dystrophin positive areas. These results unequivocally demonstrated the absence of muscle 
degeneration due to the therapeutic effects of the novel mini-dystrophins. 

Plasma membrane damage and leakage in dystrophic muscle is a major physiological 
defect. In order to determine weather a functional mini-dystrophin would be effective in 
improving plasma membrane integrity of the dystrophic myofibers following AAV vector 
mediated gene therapy, muscle cell membrane integrity was examined using Evans Blue dye, a 
vital red-fluorescent dye that is excluded by the normal myofibers, but is taken up by the 
dystrophic myofibers with leaky cell membrane due to contractile damages. A previous study of 
mdx mice revealed that the apoptotic myonuclei were exclusively found in Evans Blue dye 
positive myofibers, correlating membrane leakage and muscle cell apoptosis. [Matsuda, R. et al M 



J Biochem (Tokyo) 118, 959-964 (1995)]. In our experiments, the dye was administered into the 
tail vein of 1 5-week old vector-treated and untreated mdx mice as well as age-matched healthy 
mice. To induce mechanical stress to the muscle, the mice were allowed to exercise by 
continuous swimming for 20 minutes. Fifteen hours after dye administration, muscle samples 
were collected and examined for dystrophin expression along with Evans Blue dye uptake. 
Muscle from healthy mice revealed no uptake of the dye by the myofibers and uniform 
dystrophin staining across the muscle sections (Fig. 3, first row). The AAV vector treated mdx 
muscle showed results consistent with the normal muscle, thus demonstrating the mini- 
• dystrophins were sufficient to prevent plasma membrane damage (Fig. 3, second to fourth rows). 
Dye uptake (red fluorescence) was found only in those myofibers which stained negative for 
mini-dystrophin and confined in the area not transduced by the AAV vectors (Fig. 3, second to 
fourth rows). By contrast, the untreated mdx muscle revealed no presence of dystrophin but 
p extensive dye uptake (Fig. 3, last row). These results proved the physiological functionality of 
P the novel mini-dystrophins in maintaining membrane integrity and protecting myofibers from 
.0 mechanical damages by muscle contraction, 

M 

a In summary, the present invention provides that the dystrophin gene can be successfully 

§. reduced to one third (1/3) of its 1 1 kb full-length coding sequence, without compromising 



essential functions in protecting muscles from dystrophic phenotypes. Moreover, the present 
09 

q invention provides AAV vectors carrying the inini-genes are capable of mediating efficient and 
v stable correction of both biochemical and physiological defects in a major muscle group of a 
DMD animal model. Previous attempts to generate mini-genes that were shorter than 1/2 of the 
full length dystrophin failed to preserve the essential protective functions. [Yuasa, K. et al. t 
FEBS Lett 425, 329-336 (1998)]. Although the niini-genes contained both intact N- and C- 
terminal domains and 1 to 3 central rod repeats, they were functionally similar to a C-terminal 
dystrophin construct (Dp71 ), [Cox, G. A. et al, Nat Genet 8, 333-339 (1994); Greenberg, D. S. 
et al M Nat Genet 8, 340-344 (1994)], and thus sufficient to restore DAP complexes but 
insufficient to protect muscle from dystrophic pathology. However, the mini-dystrophin genes 
reported here accommodated at least 5 rod repeats (Rl , R2, R22, R23 and R24) and two binges 
(HI and H4). Therefore we hypothesized that the length of the central rod domain is the most 
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critical factor, based on the fact that a major role of dystrophin is to crosslink the myofiber 
cytoskelton and plasma membrane and stabilize the structure during muscle contraction. If the 
dystrophin is too short to span the sliding distance between the cytoskeleton and plasma 
membrane during muscle contraction, the crosslink will be disrupted and the muscle membrane 
will become unstable and prone to mechanical damages. To accommodate as many rod units in 
the central domain without exceeding the AAV vector packaging limit, we have for the first time 
deleted the entire C-terminus (819 bp) without sacrificing the primary functions of dystrophin. "' 
Our results indicate that 5 rods and 2 hinges seemed to provide sufficient length and flexibility 
for the central domain. This conclusion is supported by the observation that mini-genes A3 849 
and A3990 were equally functional in preventing the dystrophic phenotypes, although A3990 has 
an extra hinge (H3). Similarly, mini-gene A4173 has an extra rod (R3) but did not function better 
than mini-genes A3849 or A3990 (Table 1). In fact, because the entire AAV-MCK-4173 vector 
p cassette is nearly 5.2 kb in length, larger than the 5 kb packaging limit, the infectivity of its 
g viral particles was impaired leading to lower gene transfer efficiency (Fig. 3 and Table 1). 

SI 

SJ EXAMPLES 

? Example 1 : Construction of mini-dystrophin genes and AAV vector production 



Q3 

© Mini-dystrophin constructs were created mainly by PCR cloning method with Pfu polymerase 
P 

(Stratagene, CA) from human dystrophin cDNA (GenBank # NM 004006). For consistency, the 
numbering of the nucleotide only includes the 1 1,058 bp dystrophin coding sequence. Mini-gene 
A 3849 contains nucleotides 1 to 1668, 8059 to 10227 and 1 1047 to 1 1058. Mini-gene A3990 
contains nucleotides 1-1668, 7270-7410. 8059-10227 and 11047-11058. Mini-gene A4173 
contains nucleotides 1-1992 and 8059-10227 and 11047-11058. The above constructs were 

* * 

subcloned into an AAV vector plasmid containing an MCK promoter^ and a small polyA signal 
sequence (a gift from T. R. Flotte, University of Florida) to generate vector constructs AA V- 
MCK-3849, AAV-MCK-3990 and AAV-MCK-4173. AAV viral particles were produced 



according to previously published methods. [Xiao, X. et al., Journal of Virology 72, 2224-2232 
(1998)]. 



Example 2. Animal and vector administration 

All experiments involving animals were approved by the University of Pittsburgh Animal 
Care & Use Committee. The healthy mice C57/BJ0 and dystrophic mice max were purchased 
from The Jackson Laboratory (Bar Harbor, Maine). The 10-day old max pups were injected into 
the hindleg gastrocnemius muscle with 50 ul (5 x 10 10 viral particles) of AAV-MCK-3849, 
AAV-MCK-3990 or AAV-MCK-4173. Three months after vector injection, muscle samples 
were collected for examination. 

tn 
o 

fU Examples. Immunofluorescent staining 
IP 

s 

Muscle cryosections of 5 urn thickness were unmunofluorescently stained with the 

^ Mousc-on-Mouse Kit from the Vector Laboritories (Burlingame, CA) according to the 

P manufacturer's protocol, except that the cryosections were immediately treated with the blocking 

jjj buffer without the fixation step. Monoclonal antibodies against dystrophin (NCL-Dys3, N- 

terminal-specific and human-specific; NCL-Dys2, C-tenninal-specific), and antibodies against 
0 a-, and v-sarcoglycans (NCL-a-SARC, NCL-b-SARC and NCL-g-S ARC) were purchased 

from Novocastra Laboratories Ltd (Burlingame, CA). Cell nuclei were counter-stained with 

0.01% DAPI (Sigma, St Louis, MO) for 10 minutes. Photographs were taken with a Nikon TE- 

300 fluorescent microscope. 



In vivo myofiber plasma membrane integrity test: [Matsuda, R. et al.,JBiochem (Tokyo) 118, 
959-964 (1995)] Evans Blue dye(10 mg/ml PBS) was injected into the tail vein of 15-week old 
C57/B10*mice, max mice, and AAV vector-treated max mice at the dose of 0.1 rag/gram of body 
weight Following dye injection, mice were allowed continuous swimming for 20 minutes. At 1 5 
hours after Evans Blue injection, the mice were sacrificed and muscle were cryosectioned. Evans 
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ABSTRACT 

The present invention relates to compositions and methods for expressing mini- 
dystrophin peptides. In particular, the present invention provides compositions 
comprising nucleic acid sequences that are shorter than wild-type dystrophin cDNA 
and that express mini-dystrophin peptides that function in a similar manner as wild- 
type dystrophin proteins, and methods for expressing mini-dystrophin peptides in target 
cells. The present invention provides such shortened nucleic acid sequences in a 
variety of ways. For example, the present invention provides nucleic acid encoding 
only 4, 8, 12, 16, and 20 spectrin-like repeat encoding sequences (i.e. nucleic acid 
encoding an exact number of spectrin-like repeats that are multiples of 4), As wild- 
type dystrophin has 24 spectrin-like repeat encoding sequences, providing nucleic acid 
encoding fewer numbers of repeats reduces the size of the dystrophin gene (e.g. 
allowing the nucleic acid sequence to fit into vectors with limited cloning capacity). 
Another example of such shortened nucleic acid sequences are those that lack at least 
a portion of the carboxy-terminal domain of wild-type dystrophin nucleic acid. A 
further example of such shortened nucleic acid sequences are those that lack at least a 
portion of the 3' untranslated region, or 5' unstranslated region, or both. 



- 76 - 



ELbSfl7773filUS 

TRUNCATED DYSTROPHIN GENES 

This invention was made with Government support under contract NIH 
R01AR40864-10. The government has certain rights in this invention. 

FIELD OF THE INVENTION 

The present invention relates to compositions and methods for expressing mini- 
dystrophin peptides. In particular, the present invention provides compositions 
comprising nucleic acid sequences that are shorter than wild-type dystrophin cDNA 
and that express mini T dystrophin peptides that function in a similar manner as wild- 
type dystrophin proteins, and methods for expressing mini-dystrophin peptides in target 
cells. 

BACKGROUND OF THE INVENTION 

Muscular dystrophy is a group of inherited disorders characterized by 
progressive muscle weakness and loss of muscle tissue. Muscular dystrophies 
includes many inherited disorders, including Becker's muscular dystrophy and 
Duchenne's muscular dystrophy, which are both caused by mutations in the dystrophin 
gene. Both of the disorders have similar symptoms, although Becker's muscular 
dystrophy is a slower progressing form of the disease. Duchenne's muscular dystrophy 
is a rapidly progressive form of muscular dystrophy. 

Both disorders are characterized by progressive muscle weakness of the legs 
and pelvis which is associated with a loss of muscle mass (wasting). Muscle 
weakness also occurs in the arms, neck, and other areas, but not as severely as in the 
lower half of the body. Calf muscles initially enlarge (an attempt by the body to 
compensate for loss of muscle strength), the enlarged muscle tissue is eventually 
replaced by fat and connective tissue (pseudohypertrophy). Muscle contractures occur 
in the legs and heels, causing inability to use the muscles because of shortening of 
muscle fibers and fibrosis of connective tissue. Bones develop abnormally, causing 
skeletal deformities of the chest and other areas. Cardiomyopathy occurs in almost all 



cases. Mental retardation may accompany the disorder but it is not inevitable and 
does not worsen as the disorder progresses. The cause of this impairment is unknown- 
Becker's muscular dystrophy occurs in approximately 3 out of 100,000 people. 
Symptoms usually appear in men between the ages of 7 and 26. Women rarely 
5 develop symptoms. There is no known cure for Becker's muscular dystrophy. 

Treatment is aimed at control of symptoms to maximize the quality of life. Activity is 
encouraged. Inactivity (such as bedrest) can worsen the muscle disease. Physical 
therapy may be helpful to maintain muscle strength. Orthopedic appliances such as 
braces and wheelchairs may improve mobility and self-care. Becker's muscular 
10 dystrophy results in slowly progressive disability. A normal life span is possible; 
however, death usually occurs after age 40. 

Duchenne's muscular dystrophy occurs in approximately 2 out of 10,000 
q people. Symptoms usually appear in males 1 to 6 years old. Females are carriers of 

w the gene for this disorder but rarely develop symptoms. There is no known cure for 

w 

0315 Duchenne's muscular dystrophy. Treatment is aimed at control of symptoms to 
- maximize the quality of life! Activity is encouraged. Inactivity (such as bedrest) can 

& worsen the muscle disease. Physical therapy may be helpful to maintain muscle 

U strength and function. Orthopedic appliances such as braces and wheelchairs 

9 may improve mobility and the ability for self-care; Duchenne's muscular dystrophy 

ffb results in rapidly progressive disability. By age 10, braces may be required for 
□ walking, and by age 12, most patients are confined to a wheelchair. Bones develop 

abnormally, causing skeletal deformities of the chest and other areas. Muscular 
weakness and skeletal deformities contribute to frequent breathing disorders. 
Cardiomyopathy occurs in almost all cases. Intellectual impairment is common but is 
25 not inevitable and does not worsen as the disorder progresses. Death usually occurs 
by age 15, typically from respiratory (lung) disorders. 

Although there are no available treatments for muscular dystrophy, the 
usefulness of gene replacement as therapy for the disease has been established in 
transgenic mouse models. Unfortunately, progress toward therapy for human patients 
30 has been limited by lack of a suitable technique for delivery of such vectors, to large 
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masses of muscle cells. What is needed in the art is a . vector that can carry most of 
the dystrophin coding sequence, that can be cheaply produced in large quantities, that 
can be delivered to a large mass of muscle cells, and that provides stable expression of 
dystrophin after delivery. 

5 SUMMARY OF THE INVENTION 

The present invention relates to compositions and methods for expressing mini- 

dystrophin peptides. In particular, the present invention provides compositions 

comprising nucleic acid sequences that are shorter than full-length wild-type 

dystrophin cDNA and that express mini -dystrophin peptides that function in a similar 

10 manner as wild-type dystrophin proteins, and methods for expressing mini-dystrophin 

peptides in target cells. The present invention provides such shortened nucleic acid 

q sequences in a variety of ways. For example, the present invention provides nucleic 

[*j acids encoding only 4, 8, 12, 16, and 20 spectrin-like repeat encoding sequences (i.e. 

£8 nucleic acids encoding an exact number of spectrin-like repeats that are multiples of 
05 

^15 4). As wild-type dystrophin has 24 spectrin-like repeat encoding sequences, providing 

® nucleic acids encoding fewer numbers of repeats reduces the size of the dystrophin 

s 

H 1 gene (e.g. allowing the nucleic acid sequence to fit into vectors with limited cloning 

£3 

p capacity). Another example of such shortened nucleic acid sequences are those that 

2 lack at least a portion of the carboxy-terminal domain of wild-type dystrophin nucleic 

D 

E320 acid. A further example of such shortened nucleic acid sequences are those that lack 
at least a portion of the 3' untranslated region, or 5* unstran slated region, or both. 

In certain embodiments, the present invention provides compositions 
comprising nucleic acid encoding a mini-dystrophin peptide, wherein the mini- 
dystrophin peptide comprises a spectrin-like repeat domain, and wherein the spectrin-. 
25 like repeat domain consists of n spectrin-like repeats, wherein n is an even number . 

less than 24. In some embodiments, n is 20 or less. In other embodiments, n is 16 or 
less. In particular embodiments, n is 12 or less. In additional embodiments, n is 8 or 
less. In preferred embodiments, n is 4. In some embodiments, the present invention 
provides compositions comprising nucleic acid encoding a mini-dystrophin peptide, 
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wherein the mini- dystrophin peptide comprises a spectrin-like repeat domain, and 
wherein the spectrin-like repeat domain consists of n spectrin-like repeats, wherein n is 
4, 8, 12, 16, or 20, 

In certain embodiments, the mini-dystrophin peptide is capable of altering a 
5 measurable muscle value in a DMD animal model by at least approximately 10% of 
the wild type value. In other embodiments, the mini-dystrophin peptide is capable of 
altering a measurable muscle value in a DMD animal model by at least approximately 
20% of the wild type value. In particular embodiments, the mini-dystrophin peptide is 
capable of altering a measurable muscle value in a DMD animal model by at least 
10 approximately 30% of the wild type value. In preferred embodiments, the mini- 

dystrophin peptide is capable of altering a measurable muscle value in a DMD animal 
model to a level similar to the wild-type value (e.g. ±4%). In certain embodiments, 
p the nucleic acid comprises at least 2 spectrin-like repeat encoding sequences. In some 

W embodiments, the spectrin-like repeat encoding sequences are precise spectrin-like 
W 

5915 repeat encoding sequences. In certain embodiments, the nucleic acid is less than 5 

Jj; kilo-bases in length. In particular embodiments, the nucleic acid comprises, viral 

^ DNA. In preferred embodiments, the viral DNA comprises adeno-associated viral 

9 

h DNA. 

g5 In certain embodiments, the present invention provides compositions 

^0 comprising nucleic acid encoding a mini-dystrophin peptide, wherein the mini- 
O dystrophin peptide comprises a spectrin-like repeat domain, and wherein the spectrin- 

like repeat domain consists of n spectrin-like repeats, wherein n is an even number 
less than 24; and wherein the nucleic acid comprises an actin-bindihg domain 
encoding sequence, a P-dystroglycan-binding domain encoding sequence, and at least 2 
25 spectrin-like repeat encoding sequences. In some embodiments, the nucleic acid 
comprises at least 4 spectrin-like repeat encoding sequences. 

In certain embodiments, the present invention provides compositions 
comprising nucleic acid, wherein the nucleic acid comprises at least 2 spectrin-like 
repeat encoding sequences, and wherein the nucleic acid encodes a mini-dystrophin 
30 peptide comprising a spectrin-like repeat domain, wherein the spectrin-like repeat 
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domain consists of n spectrin-like repeats, and wherein n is an even number less than 
24. In some embodiments, the nucleic acid comprises at least 4 spectrin-like repeat 

encoding sequences. 

In some embodiments, the nucleic acid comprises SEQ ID NO:39 (i.e. AR4- 
5 R23). In other embodiments, the nucleic acid comprises SEQ ID NO:40 (i.e. AR2- 
R21). In certain embodiments, the nucleic acid comprises SEQ ID NO:41 (i.e. AR2- 
R21+H3). In still other embodiments, the nucleic acid comprises SEQ ID NO:42 (/.e. 
AH2-R19). 

In certain embodiments, the nucleic acid comprises an expression vector (e.g. 

10 plasmid, virus, etc). In ' some embodiments, the expression vector comprises viral 

DNA. In certain embodiments, the viral DNA comprises adeno-viral DNA. In some 

embodiments, the viral DNA comprises lentiviral DNA. In other embodiments, the 

p viral DNA comprises helper-dependent adeno-viral DNA. In'preferred embodiments, 

^ the viral DNA comprises adeno-associated viral DNA. In some embodiments, the 

Ul ■ 
Hi 5 nucleic acid is inserted in a virus (e.g. adeno-associated virus, adenovirus, helper- 

EB 

£ dependent adeno-associated virus, lenti virus). 

® In certain embodiments, the nucleic acid comprises an actin-binding domain 

h encoding sequence. In particular embodiments, the actin binding domain comprises at 

| least a portion of SEQ ID NO:6 (e.g. 5%, 10%, 20%, 40%, 50%, or 75% of SEQ ID 

^0 NO:6). In other embodiments, the actin binding domain comprises at least a portion 
p of a homolog or . mutated version of SEQ ID NO:6 (e.g. 5%, 10%, 20%, 40%, 50%, or 

75% of a SEQ ID NO:6 homolog or mutated version of SEQ ID NO:6). In certain 
embodiments, the nucleic acid comprises a p-dystroglycan binding domain. In certain 
embodiments, the p-dystroglycan binding domain comprises at least a portion of a 
25 dystrophin hinge 4 encoding sequence (e.g. the 3' 50% of SEQ ID NO:34), and at least 
a portion of dystrophin cysteine-rich domain encoding sequence (e.g. the 5' 75% of 
SEQ ID NO:35). In particular embodiments, at least a portion of hinge 4 is the WW 
domain (SEQ ED NO:45), or a homolog or mutation thereof. 



- 5 - 



10 



In particular embodiments, the spectrin-like repeat encoding sequences are 
. selected from the group consisting of SEQ ID NOS:8-10, 12-27, and 29-33. In some 
embodiments, the spectrin-like repeat encoding sequences are selected from the group 
consisting of SEQ ID NOS:8-10, 12-27, and 29-33, and homologs or mutations of 
SEQ ID NOS:8-10, 12-27, and 29-33, In preferred embodiments, the spectrin-like 
repeat encoding sequences are selected from the group consisting of SEQ ID NOS:8- 
10 and 29-33. In some embodiments, the spectrin-like repeat encoding sequences are 
identical (e.g. all the sequences are SEQ ID NO:8). In preferred embodiments, the 
spectrin-like repeat encoding sequences are all different (e.g. the nucleic acid sequence 
has only 4 spectrin-like repeat encoding sequences, and these 4 are: SEQ ID NO:8, 
SEQ ID NO:9, SEQ ID NO: 10, and SEQ ID NO:33). In certain embodiments, nucleic 
acid sequence comprises at least one spectrin-like repeat encoding sequence selected 
p from the group consisting of SEQ ID NOS:8-10, and at least one spectrin-like repeat 

N encoding sequence selected from the group consisting of SEQ ID NOS:29-33. 

$15 In certain embodiments, the nucleic acid comprises at least one dystrophin 

jjg hinge region.. In some embodiments, the nucleic acid comprises at least one 

W dystrophin hinge region selected from hinge region 1, hinge region 2, hinge region 3 

and hinge region 4. In some embodiments, the nucleic acid comprises at least one 
dystrophin hinge region selected from hinge region 1, hinge region 2, and hinge region 
0 3. In particular embodiments, dystrophin hinge region 1 is SEQ ED NO:7, or a 

B homolog (See, e.g. Fig. 1 1), or a mutant version thereof. In particular embodiments, 

dystrophin hinge region 2 is SEQ ID NO: 1 1, or a homolog (See. e.g. Fig. 1 1), or a 
mutant version thereof. In certain embodiments, dystrophin hinge region 3 is SEQ ID 
NO:28, or a homolog (See, e.g. Fig. 11), or a mutant version thereof. In other 
25 embodiments, dystrophin hinge region 4 is SEQ ID NO:34, or a homolog (See, e.g. 

Fig. 11), or a mutant" version thereof. 

In some embodiments, the nucleic acid comprises at least a portion, of wild- 
type dystrophin C-terminal protein. In other embodiments, the nucleic acid comprises 
at least a portion of the 5' untranslated region. In particular embodiments, the nucleic 
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acid comprises at least a portion of the 3* untranslated region. In different 
embodiments, the nucleic acid sequence comprises regulatory sequences (e.g. MCK 
enhancer and promoter elements). In particular embodiments, the nucleic acid 
sequence is operably linked to regulatory sequences (e.g. MCK enhancer and promoter 
elements). In certain embodiments, the nucleic acid sequence comprises a mutant 
muscle-specific enhancer region. 

In particular embodiments, the nucleic acid has less than 75% of a wild type 
dystrophin 5' untranslated region. In other embodiments, the nucleic acid has less than 
50% or 20% or 1% (e.g. 0, 1, 2 nucleotides from a wild type dystrophin 5* 
untranslated region). In particularly preferred embodiments, the nucleic acid sequence 
does not contain any of the wild-type dystrophin 5' untranslated region. In certain 
embodiments, the nucleic acid has less than 75% of a wild type dystrophin 3' 
untranslated region. In other embodiments, the nucleic acid has less than 50%, 
preferably less than 40%, more preferably less than 35% of a wild type dystrophin 3' 
untranslated region. In certain embodiments, the nucleic acid does not contain a wild- 
type dystrophin 3' untranslated region (or, in some embodiments, any type of 3' 
untranslated region). 

In particular embodiments, the mini-dystrophin peptide comprises a 
substantially deleted dystrophin C-terminal domain. In some embodiments, the mini- 
dystrophin peptide comprises less than 40% of wild type dystrophin C-terminal 
domain, preferably less than 30%, more preferably less than 20%, even more 
preferably less than 1%, and most preferably approximately 0% (e.g. 0, 1, 2, 3 or 4 
amino acids from the wild type dystrophin C-terminal domain). In some . 
embodiments, the nucleic acid sequence comprises at least one intron sequence. 

In some embodiments, the present invention provides methods for expressing a 
mini-dystrophin peptide in a target cell, comprising; a) providing; i) a vector 
comprising nucleic acid encoding a mini-dystrophin peptide, wherein the mini- 
dystrophin peptide comprises a spectrin-like repeat domain, and wherein the spectrin- 
like repeat domain consists of n spectrin-like repeats, wherein n is an even number 
less than 24, and ii) a target cell, and b) contacting the vector with the target cell 



under conditions such that the mini-dystrophin peptide is expressed in the target cells. 
In certain embodiments, the contacting comprises transfecting. In other embodiments, 
the target cell is a muscle cell. Tn particular embodiments, the target cell further 
comprises a subject (e.g. with Duchenne muscular dystrophy (DMD) or Becker 
muscular dystrophy (BMD)). In preferred embodiment, the mini-dystrophin peptide is 
expressed in the cells of a subject (e.g. such that symptoms of DMD or BMD are 
reduced or eliminated). 

In particular embodiments, the present invention provides compositions 
comprising nucleic acid, wherein the nucleic acid encodes a mini-dystrophin peptide, 
and wherein the mini-dystrophin peptide comprises a substantially deleted dystrophin 1 
C-terminal domain. In certain embodiments, the substantially deleted dystrophin C- 
terminal domain is less than 40% of a wild type dystrophin C-terminal domain. In 
other embodiments, the substantially deleted dystrophin C-terminal domain is less than 
30%, 20%, or 1% of a wild type dystrophin C-terminal domain. In preferred 
embodiments, the substantially deleted dystrophin C-terminal domain is approximately 
0% of a wild type dystrophin C-terminal domain. In certain embodiments, the mini- 
dystrophin peptide does not contain any portion of the wild type dystrophin C-tcrminal 
domain (i.e. it is completely deleted), 

In certain embodiments, the mini-dystrophin peptide is capable of altering a 
measurable muscle value in a DMD animal model by at least 10% of the wild type 
value. In other embodiments, the mini-dystrophin peptide is capable of altering a 
measurable muscle value in a DMD animal model by at least 20% of the wild type 
value. In particular embodiments, the mini-dystrophin-peptide is capable of altering a 
measurable muscle value in a DMD animal model by at (east 30% of the wild type 
value. In preferred embodiments, the mini-dystrophin peptide is capable of altering a 
measurable muscle value in a DMD animal model to a levelsimilar to the wild-type 
value (e.g. ±4%). 

In certain embodiments, the nucleic acid comprises an expression vector (e.g. 
plasmid, virus, etc). In some embodiments, the expression vector comprises viral 
DNA. In certain embodiments, the viral DNA comprises adeno-vira! DNA. In some 



embodiments, the viral DNA comprises leniviral DNA. In other embodiments, the 
viral DNA comprises helper-dependent adeno-viral DNA. In preferred embodiments, 
the viral DNA comprises adeno-associated viral DNA, In some embodiments, the 
nucleic acid is inserted in a virus (e.g. adeno-associated virus, adenovirus, helper- 
5 dependent adeno-associated virus, lentivirus). 

In certain embodiments, the nucleic acid comprises an actin-binding domain 
encoding sequence. In particular embodiments, the 'actin binding domain comprises at 
least a portion of SEQ ID NO:6 (e.g. 5%, 10%, 20%, 40%, 50%, or 75% of SEQ ID 
NO:6). In other embodiments, the actin binding domain comprises at least a portion 
10 of a homolog or mutated version of SEQ ID NO:6 (e.g. 5%, 10%, 20%, 40%, 50%, or 
75% of a SEQ ID NO:6 homolog or mutated version of SEQ ID NO:6). In certain 
embodiments, the nucleic acid comprises a p-dystroglycan binding domain. In certain 
p embodiments, the P-dystroglycan binding domain comprises at least a portion of a 

^ dystrophin hinge 4 encoding sequence (e.g. the 3' 50% of SEQ ID NO: 3 4), and at least 

C3l5 a portion of dystrophin cysteine-rich domain encoding sequence (e.g. the 5' 75% of 

® SEQ ID NO:35). In particular embodiments, at least a portion of hinge 4 is the WW 

® domain (SEQ ID NO:45), or a homolog or mutation thereof. 

H« In certain embodiments, the nucleic acid comprises at least one dystrophin 

g hinge region. In some embodiments, the nucleic acid comprises at least one 

jj^O dystrophin hinge region selected .from hinge region 1, hinge region 2, hinge region 3 
□ and hinge region 4. In some embodiments, the nucleic acid comprises at least one 

dystrophin hinge region selected from hinge region 1, hinge region 2, and hinge region 
3. In particular embodiments, dystrophin hinge region 1 is SEQ ID NO:7, or a 
homolog (See, e.g. Fig. 11), or a mutant version thereof. In particular embodiments, 
25 dystrophin hinge region 2 is SEQ ID NO: 11, or a homolog (See, e.g. Fig. 1 1), or a 

.mutant version thereof In certain embodiments, dystrophin hinge region 3 is SEQ ID 
NO:28, or a homolog {See, e.g. Fig. 11), or a mutant version thereof. In other 
embodiments, dystrophin hinge region 4 is SEQ ID NO:34, or a homolog (See, e.g. 
Fig. 1 1), or a mutant version thereof. 
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In other embodiments, the nucleic acid comprises at least a portion of the 5* 
untranslated region. In particular embodiments, the nucleic acid comprises at least a 
portion of the 3' untranslated region. In different embodiment, the nucleic acid 
sequence comprises regulatory sequences {e.g. MCK enhancer and promoter elements). 
In particular embodiments, the nucleic acid sequence is operably linked to regulatory t 
sequences (e.g. MCK enhancer and promoter elements). In certain embodiments, the 
nucleic acid sequence comprises a mutant muscle-specific enhancer region. 

In particular embodiments, the nucleic acid contains less that 75% of a wild 
type dystrophin 5' untranslated region. In other embodiments, the nucleic acid 
contains less than 50% or 20% or 1% (e.g. 0, 1, 2 nucleotides from a wild type 
dystrophin 5' untranslated region). In particularly preferred embodiments, the nucleic 
acid sequence does not contain any of the wild-type dystrophin 5' untranslated region. 
In certain embodiments, the nucleic acid has less than 75% of a wild type dystrophin 
3' untranslated region. In other embodiments, the nucleic acid has less than 50%, 
preferably less than 40%, more preferably less than 35% of a wild type dystrophin 3* 
untranslated region. In certain embodiments, the nucleic acid does not contain a wild- 
type dystrophin 3' untranslated region (or, in some embodiments, any type of 3' 
untranslated region). 

In some embodiments, the present invention provides methods for expressing a 
mini-dystrophin peptide in a target cell, comprising; a) providing; i) a vector 
comprising nucleic acid, wherein the nucleic acid encodes a mini-dystrophin peptide 
comprising a substantially deleted dystrophin C-terminal domain, and ii) a target cell, 
and b) contacting the vector with the target cell under conditions such that the mini- 
dystrophin peptide is expressed in the target cells. In certain embodiments, the 
contacting comprises transfecting. In other embodiments, the target cell is a muscle 
cell. 

DESCRIPTION OF THE FIGURES 

Figure 1 shows the nucleic acid sequence for wild-type human dystrophin 

cDNA. 
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Figure 2 shows the nucleic acid sequence for wild-type mouse dystrophin 

cDNA. 

Figure 3 shows the nucleic acid sequence for wild-type human utrophin cDNA. 
Figure 4 shows the nucleic acid sequence for wild-type mouse utrophin cDNA 
5 Figure 5 shows various domains of the nucleic acid sequence for wild-type 

human dystrophin cDNA. 

Figure 6 shows various domains of the nucleic acid sequence for wild-type 

human dystrophin cDNA. 

Figure 7 shows various domains of the nucleic acid sequence for wild-type 

10. human dystrophin cDNA. 

Figure 8 shows various domains of the nucleic acid sequence for wild-type 

human dystrophin cDNA. 
g Figure 9 shows various domains of the nucleic acid sequence for wild-type • 

W human dystrophin cDNA. 

015 ' Figure 10 shows the 3' UTR domain nucleic acid sequence for wild-type 

^ human dystrophin cDNA. 

W Figure 1 1 shows a sequence alignment between wild-type human dystrophin 

^ cDNA and wild-type mouse dystrophin cDNA. The various domains in the human 

§ dystrophin sequence have spaces between them with the ends highlighted in bold. In 

8^20 this regard, homologous sequences for various domains in the mouse cDNA sequence 

D 

n are seen. 

Figure 12 shows the nucleic acid sequence for AR4-R23, a nucleic acid 

sequence encoding a mini-dystrophin peptide. 

Figure 13 shows the nucleic acid sequence for AR2-R21, a nucleic acid 
25 sequence encoding a mini-dystrophin peptide. 

Figure 14 shows the nucleic acid sequence for AR2-R21+H3, a nucleic acid 
sequence encoding a mini-dystrophin peptide. 

Figure 15 shows the nucleic acid sequence for AH2-R19, a nucleic acid 
sequence encoding a mini-dystrophin peptide. 
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Figure 16 shows the complete cDNA sequence for human skeletal muscle alpha 

actinin. 

Figure 17 shows the nucleic acid sequence for AR9-R16, a nucleic acid 
sequence encoding a mini -dystrophin peptide. 
5 Figure 18 shows the nucleic acid sequence for the WW domain. 

Figure 19 shows various transgenic expression constructs tested in Example 1. 
Figure 20 shows the contractile properties of EDL, soleus, and diaphragm 
muscles in wild-type, mdx, and dystrophin A71-78 mice. 

Figure 21 show the nucleic acid sequence for pBSX. 
10 Figure 22 shows a restriction map for pBSX. 

Figure 23 shows the 'full-length' HDMD sequence. 
Figure 24 shows the cloning procedure for A&4-R23. 
Figure 25 shows the cloning procedure for AR2-R21+H3. 



fU Figure 26 shows the cloning procedure for AR2-R21. 

p 5 Figure 27 shows a schematic illustration of the domains encoded by the 

® truncated and full-length dystrophins sequences tested in Example 5. 

(39 Figure 28 is a graph showing the percentage of myofibers in quadricep muscles 

l± of 3 month old mice that display centrally-located nuclei in the indicated strains of 

D 

g transgenic mice. 

CTfeO Figure 29 shows graphs depicting the force generating capacity in diaphragm 

p (A) or EDL (B) muscles of the indicated strains of dystrophin transgenic mdx mice 

and control mice. 

Figure 30 shows a graph depicting the force generating capacity in EDL (A) or 
diaphragm (B) muscles of the indicated strains of dystrophin transgenic mdx mice and 

25 control mice. 

Figure 31 is a graph showing the percentage of force generating capacity lost 
after 1 or 2 lengthening contractions of the tibialis anterior muscle of the indicated 
strains of dystrophin transgenic mdx mice and control mice. 
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Figure 32 is a graph showing the total distance run on a treadmill by animals 
from the indicated strains of dystrophin transgenic mdx mice and control mice. 

Figure 33 shows a graph depicting the total body mass (A) and mass of the 
tibialis anterior muscle (B) of the indicated strains of dystrophin transgenic mdx mice 
and control mice. 

Figure 34 is a schematic illustration of the structure of a microdystrophin 
expression cassette inserted into an adeno-associated viral vector, 

Figure 35 is a schematic illustration of the structure of plasmid pTZ19R (top) 
and the sequence of the multiple cloning site in the vector (bottom). 

Figure 36 shows the nucleic acid sequence of various MCK enhancer regions 

(wild-type and mutant). 

Figure 37 shows the nucleic acid sequence of various MCK promoter regions. 
Figure 38 shows a comparison between domains in dystrophin and utrophin. 

DEFINITIONS 

To facilitate an understanding of the present invention, a number of terms and 

phrases are defined below: 

As used herein, the term "measurable muscle values" refers to measurements of 
dystrophic symptoms (e.g. fibrosis, an increased proportion of centrally located nuclei, 
reduced force generation by skeletal muscle, etc.) in an animal. These measurements 
may be taken, for example, to determine the wild-type value [i.e. the value in a control 
animal), to determine the value in a DMD (Duchenne muscular dystrophy) animal 
model (e.g. in an mdx mouse model), and to determine the value in a DMD animal 
model expressing the mini-dystrophin peptides of the present invention. Various 
assays may be employed to determine measurable muscle values in an animal 
including, but not limited to, assays measuring fibrosis, phagocytic infiltration of 
muscle tissue, variation in myofiber size, an increased proportion of myofibers with 
. centrally located nuclei, elevated serum levels of muscle pyruvate kinase, contractile 
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properties assays, DAP (dystrophin associated protein) assays, susceptibility to 
contraction induced injuries and measured force assays (See Examples 1 and 4). 

As used herein, the term "mini-dystrophin peptide" refers to a peptide that is 
smaller in size than the full-length wild-type dystrophin peptide, and that is capable of 
altering (increasing or decreasing) a measurable muscle value in a DMD animal model 
by at least approximately 10% such that the value is closer to the wild-type value (e.g. 
a mdx mouse has a measurable muscle value that is 50% of the wild-type value, and 
this value is increased to at least 60% of the wild-type value; or a mdx mouse has a 
measurable muscle value that is 150% of the wild-type value, and this value is 
decreased to at most 140% of the wild-type value). In some embodiments, the mini- 
dystrophin-peptide is capable of altering a measurable muscle -value in a DMD animal 
model by at least approximately 20% of the wild type value. In certain embodiments, 
the mini-dystrophin-peptide is capable of altering a measurable muscle value in a 
DMD animal model by at least approximately 30% of the wild type value. In 
preferred embodiments, the mini-dystrophin peptide is capable of altering a measurable 
muscle value in a DMD animal model to a level similar to the wild-type value (e.g. 
±4%). 

As used herein, the term "wild-type dystrophin cysteine-rich domain" refers to 
a peptide encoded by the nucleic acid sequences in SEQ ID NO:35 (e.g. in human), as 
well as wild type peptide homologs encoded by nucleic acid homologs of SEQ ID 

NO:35 (See, Fig. 11). 

As used herein,, the term "wild type dystrophin C-terminal domain" refers to a 
peptide encoded by the nucleic acid sequences in SEQ ID NO:36 (e.g. in human), as 
well as wild type peptide homologs encoded by nucleic acid homologs of SEQ ID 

NO:36 (See, Fig. 11).' 

As used herein, the term "mini-dystrophin peptide comprising. a substantially 
deleted dystrophin C-terminal domain" refers to a mini-dystrophin peptide that has less 
than 45% of a wild type dystrophin C-terminal domain. In some embodiments, the 
mini-dystrophin peptide comprises less than 40% of wild type dystrophin C-terminal 



domain, preferably less than 30%, more preferably less than 20%, even more 
preferably less than 1%, and most preferably approximately 0% (e.g. 0, 1, 2, 3 or 4 
amino acids from the wild type dystrophin C-terminal domain). The construction of 
mini-dystrophin peptides with a substantially deleted dystrophin C-terminal domain 
may be accomplished, for example, by deleting all or a portion of SEQ ID NO:36 
from human dystrophin SEQ ID NO:l (See, e.g. Example 3C). 

As used herein, the term "wild type dystrophin 5* untranslated region" refers to 
the nucleic acid sequence at the very 5' end of a wild type dystrophin nucleic acid 
sequence (e.g. SEQ ID NOS:l and 2) that immediately precedes the amino acid coding 
regions. For example, for human dystrophin, SEQ ID NO:5 (the first 208 bases) is the 
5* untranslated region (a homolog in mouse may be seen in Fig. 1 1). 

As used herein, the term "wild type dystrophin 3' untranslated region" refers to 
the nucleic acid sequence at the very 3* end of a wild type dystrophin nucleic acid 
sequence (e.g. SEQ ID NOS:l and 2) that immediately proceeds the amino acid 
coding regions. For example, for human dystrophin, SEQ ID NO:38 (the last 2690 
bases of the human dystrophin gene) is the 3' untranslated region (a homolog in mouse 

may be seen in Fig. 1 1). 

As used herein, the term "actin-binding domain encoding sequence" refers to 
the portion of a dystrophin nucleic sequence that encodes a peptide-domain capable of 
binding actin in vitro (e.g. SEQ ID NO:6), as well as homologs (See, Fig. 11), 
conservative mutations, and truncations of such sequences that encode peptide-domains 
that are capable of binding actin in vivo. Determining whether a particular nucleic 
acid sequence encodes a peptide-domain (e.g. homolog, mutation, or truncation of 
SEQ ID NO:6) that will bind actin in vitro may be performed, for example, by 
screening the ability of the peptide-domain to bind actin in vitro in a simple actin 
binding assay (See, Corrado et ai t FEBS Letters, 344:255-260 [1994], describing the 
expression of candidate dystrophin peptides as fusion proteins, absorbing F-actin on to 
microtiter plates, incubating the candidate peptides in the F-actin coated microtiter 
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plates, washing the plates, adding anti-fusion protein rabbit antibody, and adding an 
anti-rabbit antibody conjugated to a detectable marker). 

As used herein, the term "p-dystroglycan-binding domain encoding sequence- 
refers to the portion of a dystrophin nucleic sequence that encodes a peptide-domain 
capable of binding p-dystroglycan in vivo (e.g. SEQ ID NOs.34 and 35), as well as 
homologs {See, Fig. 11), conservative mutations, and truncations of such sequences 
that encode peptide-domains that are capable of binding p-dystroglycan in vivo. In 
preferred embodiments, the p-dystroglycan-binding domain encoding sequence 
includes at least a portion of a hinge 4 encoding region (e.g. SEQ ID NO:45, the WW 
domain) and at least a portion of a wild-type dystrophin cysteine-rich domain (e.g. at 
least a portion of SEQ ID NO:35) (See. e.g. Jung et ah, JBC, 270 (45):27305 [1995]). 
Determining whether a particular nucleic acid sequence encodes a peptide-domain (e.g. 
homolog, mutation, or truncation) that will bind p-dystroglycan in vivo may be 
performed, for example, by first screening the ability of the peptide-domain to bind p- 
dystroglycan in vitro in a simple p-dystroglycan binding assay (See, Jung et al, pg 
27306 - constructing peptide-domain dystrophin-GST fusion peptides and radioactively 
labelled p-dystroglycan, immobilizing the fusion proteins on glutathione-agarose 
beads, incubating the beads with the radioactively labelled p-dystroglycan, pelleting 
the beads, washing the beads, and resolving the sample on an SDS-polyacrylamide gel, 
staining with Coomasie blue, exposing to film, and quantifying the amount of 
radioactivity present). Nucleic acid sequences found to express peptides capable of 
binding p-dystroglycan in such assays may then, for example, be tested in vivo by 
transfecting a cell line (e.g., COS cells) with two expression vectors, one expressing 
the dystroglycan peptide and the other expressing the candidate peptide domain (as a 
fusion protein). After culturing the cells, the protein is then extracted and a co- 
immunoprecipitation is performed for one of the proteins, followed by a Western blot 
for the other. 

As used herein, the term "spectrin-like repeats" refers to peptides composed of 
approximately 100 amino acids that are responsible for the rod-like shape of many 
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structural proteins including, but not limited to, dystrophin, utrophin, fodrin, alpha- 
actin, and spectrin, when the spectrin-like repeats are present in multiple copies (e.g. 
dystrophin-24, utrophin-22, alpha-actin-4, spectrin-16, ect). Spectrin-like repeats also 
refers to mutations of these natural peptides, such as conservative changes in amino 
acid sequence, as well as the addition or deletion of up to 5 amino acids to/from the 
end of a spectrin-like repeat. Spectrin-like repeats includes 'precise spectrin-like 
repeats" (see below). Examples of spectrin-like repeats include, but are not limited to, 
peptides encoded by nucleic acid sequences found in wild-type human dystrophin (e.g. 
SEQ ID NOS:8-10, 12-27, and 29-33). 

As used herein, the term "spectrin-like repeat encoding sequences" refers to 
nucleic acid sequences encoding gpectrin-like repeat peptides. 1 This term includes 
natural and synthetic nucleic acid sequences encoding the spectrin-like repeats (e.g. 
both the naturally occurring and mutated spectrin-like repeat peptides). Examples of 
spectrin-like repeat encoding sequences include, but are not limited to, SEQ ID 

NOS:8-10, 12-27, and 29-33. 

As used herein, the term "precise spectrin-like repeat encoding sequences" 
refers to nucleic acid sequences encoding spectrin-like repeat peptides with up to 1 
additional amino acid added to, or deleted from, the spectrin-like repeat. 

As used herein, the term M spectrin-like repeat domain" refers to the region in a 
mini-dystrophin peptide that contains the spectrin-like repeats of the mini-dystrophin 
peptide. 

The term "gene" refers to a DNA sequence that comprises control and coding 
sequences necessary for the production of a polypeptide or precursor thereof. The 
polypeptide can be encoded by a full length coding sequence or by any portion of the 
coding sequence so long as the desired enzymatic activity is retained. The term 
"gene" encompasses both cDNA and genomic forms of a given gene. 

The term "wild-type" refers to a gene, gene product, or other sequence that has 
the characteristics of that gene or gene product when isolated from a naturally 
occurring source. A wild-type gene is that which is most frequently observed in a 
population and is thus arbitrarily designated the "normal" or "wild-type" form of the 
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gene. In contrast, the term "modified*' or "mutant" refers to a gene, gene product, or 
other sequence that displays modifications in sequence and or functional properties 
(e.g. altered characteristics) when compared to the wild-type gene or gene product. It 
is noted that naturally-occurring mutants can be isolated; these are identified by the 
5 . fact that they have altered characteristics when compared to the wild-type gene or 
gene product. 

The term "oligonucleotide" as used herein is defined as a molecule comprised 
of two or more deoxyribonucleotides or ribonucleotide, usually more than three (3), 
and typically more than ten (10) and up to one hundred (100) or more (although 
10 preferably between twenty and thirty). The exact size will depend on many factors, 
which in turn depends on the ultimate function or use of the oligonucleotide. The 
oligonucleotide may be generated in any manner, including chemical synthesis, DNA 
p replication, reverse transcription, or a combination thereof. 

W As used herein, the term "regulatory sequence" refers to a genetic sequence or 

UJ 

Cfll 5 element that controls some aspect of the expression of nucleic acid sequences. For 

^ example, a promoter is a regulatory element that facilitates the initiation of 

8 transcription of an operably linked coding region. Other regulatory elements are 

^ enhancers, splicing signals, polyadenylation signals, termination signals, etc. Examples 

§ include, but are not limited to, the 5' UTR of the dystrophin gene (SEQ ID NO:5), 

ffiiO MCK promotors and enhancers (both wild type and mutant, See U.S. provisional app. 
P 

□ ser no. 60/218,436, hereby incorporated by reference). 

Transcriptional control signals in eucaryotes comprise "promoter" and 
"enhancer" elements. Promoters and enhancers consist of short arrays of DNA 
sequences that interact specifically with cellular proteins involved in transcription. 
25 The present invention contemplates modified enhancer regions. 

The term "recombinant DNA vector" as used herein refers to DNA sequences 
containing a desired coding sequence and appropriate DNA sequences necessary for 
the expression of the operably linked coding sequence in a particular host organism 
(e.g., mammal). DNA sequences necessary for expression in procaryotes include a 
30 promoter, optionally an operator sequence, a ribosome binding site and possibly other 
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sequences. Eok^otic crfl." «• known « -tili» P««a«-. I-ly-"!^ 

and enhancers. . 

The terms "in operable combination", "in operable order" and "operably hnked 

as used herein refer to the linkage of nucWc acid sequences in such a manner that a 

5 nucleic acid molecule capable of directing the transcription of a given gene and/or the 

synthesis of a desired protein molecule is produced. The term also refers to the 

linkag e of amino acid sequences in such a manner so that a functional protem . 

produced. _ , 

"Hybridization" methods involve the annealing of a complementary sequence to 

l0 the target nucleic acid (the sequence to be detected). The ability of two polymers of 
nucleic acid containing complementary sequences to find each other and anneal 
through base pairing interaction is a well-recognized phenomenon. 
* The "complement" of a nucleic acid sequence as used herein refers to an 

oligonucleotide which, when aligned with the nucleic acid sequence such that the S 
0*5 end of one sequence is paired with the 3' end of the other, is in "antiparallel 
» association." Complementarity need not be perfect; stable duplexes may contam 

1 ^matched base pairs or unmatched bases. Those skilled in the art of nucleic acd 

"u technology can determine duplex stability empirically considering a number of 

5 variables including, for example, the length of the oligonucleotide, base composU.on 

§«) and sequence of the oligonucleotide, ionic strength and incidence of mismatched base 

Q 

The term "homology" refers to a degree of complementarity. There may be 
partial homology or complete homology (i.e., identity). A partially complementary 
sequence is one that at least partially inhibits a completely complementary sequence 

25 from hybridizing to a.target nucleic acid is referred to using the functional term 
"substantially homologous." The inhibition of hybridization of the completely 
complementary sequence to the target sequence may be examined using a 
hybridization assay (Southern or Northern blot, solution hybridization and the like) 
under conditions of low stringency. A substantially homologous sequence or probe 

30 will compete for and inhibit the binding [i.e., the hybridization) of a completely 
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homologous to a target under conditions of low stringency. This is not to say that 
conditions of low stringency are such that non-specific binding is permitted; low 
stringency conditions require that the binding of two sequences to one another be a 
specific (i.e., selective) interaction. The absence of non-specific binding may be tested 
5 by the use of a second target that lacks even a partial degree of complementarity (e.g., 
less than about 30% identity); in the absence of non-specific binding the probe will 
not hybridize to the second non-complementary target. 

As used herein the term "stringency" is used in reference to the conditions of 
temperature, ionic strength, and the presence of other compounds such as organic 
10 solvents, under which nucleic acid hybridizations are conducted. Those skilled in the 
art will recognize that "stringency" conditions may be altered by varying the 
parameters just described either individually or in concert. With "high stringency" 
g conditions, nucleic acid base pairing will occur only between nucleic acid fragments 

W that have a high frequency of complementary base sequences (e.g., hybridization under 

(XI5 "high stringency" conditions may occur between homologs with about 85-100% 

^ identity, preferably about 70-100% identity). With medium stringency conditions, 

03 nucleic acid base pairing will occur between nucleic acids with an intermediate 

H frequency of complementary base sequences (e.g., hybridization under "medium 

p stringency" conditions may occur between homologs with about 50-70% identity). 

Olo Thus conditions of "weak" or "low" stringency are often required with nucleic acids 
q that are derived from organisms that are genetically diverse, as the frequency of 

complementary sequences is usually less. 

Low stringency conditions when used in reference to nucleic acid hybridization 
comprise conditions equivalent to binding or hybridization at 42°C in a solution 
25 consisting of 5X SSPE (43.8 g/1 NaCU, 6.9 g/1 NaH 2 P0 4 -H 2 0 and 1,85 g/l EDTA, pH 

adjusted to 7.4 with NaOH), 0.1% SDA, 5X Denhardfs reagent [SOX Denhardfs 
contains per 500 ml: 5 g Ficoll (Type 400, Pharmacia), 5 g BSA (Fraction V, Sigma)] 
and 100 ug/ml denatured salmon sperm DNA followed by washing in solution 
comprising 5X SSPE, 0.1% SDS at 42 6 C when a probe of about 500 nucleotides in 
30 length is employed. 
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High stringency condistions when used in reference to nucleic acid 
hybridization comprises conditions equivalent to binding or hybridizing at 42°C in a 
solution consisting of 5X SSPE (43.8 g/1 NaCL, 6.9 g/1 NaH 2 P0 4 -H 2 0 and 1.85 g/1 
EDTA, pH adjusted to 7.4 with NaOH), 6.5% SDS, 5X Denhardt's Teagent and 100 
ug/ml denatured salmon sperm DNA, followed by washing in a solution comprising 
0.1X SSPE, 1.0% SDS at 42°C when a probe of about 500 nucleotides is employed. 
The art knows well that numerous equivalent conditions may be employed to 
comprise low stringency conditions; factors such as the length and nature (DNA, 
RNA, base composition) of the probe and nature of the target (DNA, RNA, base 
composition, present in solution or immobilized, etc.) and the concentration of the 
salts and other components (e.g., the presence or absence of formamide, dextran 
sulfate, polyethylene glycol) are considered and the hybridization solution may be 
varied to generate conditions of low stringency hybridization different from, but 
equivalent to, the above listed conditions. In addition, the art knows conditions that 
promote hybridization under conditions of high stringency (e.g., increasing the 
temperature of the hybridization and/or wash steps, the use of formamide in the 
hybridization solution, etc.). 

The term M trans fection" as used herein refers to the introduction of foreign 
DNA into eukaryotic cells. Transfection may be accomplished by a variety of means 
known to the art including calcium phosphate-DNA co-precipitation, DEAE-dextran- 
mediated transfection, polybrene-mediated transfection, electroporation, microinjection, 
liposome fusion, Hpofection, protoplast fusion, retroviral infection, and biolistics. 

The term "stable transfe^on" or "stably transfected" refers to the introduction 
and integration of foreign DNA into the genome of the transfected cell. The term 
"stable transfectant" refers to a cell which has stably integrated foreign DNA into the 
genomic DNA. 

As used herein, the terms "nucleic acid molecule encoding," "DNA sequence 
encoding," and "DNA encoding" refer to the order or sequence of 
deoxyribonucleotides along a strand of deoxyribonucleic acid. The order of these 
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deoxyribonucieotides determines the order of amino acids along the polypeptide 
(protein) chain. The DNA sequence thus codes for the amino acid sequence. 

As used herein, the terms "muscle ceil" refers to a cell derived from muscle 
tissue, including, but not limited to, cells derived from skeletal muscle, smooth muscle 
(e.g. from the digestive tract, urinary bladder, and blood vessels), and cardiac muscle. 
The term includes muscle cells in vitro, ex vivo, and in vivo. Thus, for example, an 
isolated cardiomyocyte would constitute a muscle cell, as would a cell as it exists in 
muscle tissue present in a subject in vivo. This term also encompasses both terminally 
differentiated and nondifferentiated muscle cells, such as myocytes, myotubes, 
myoblasts, cardiomyocytes, and cardiomyoblasts. 

As used herein, the term "muscle-specific" in reference to an regulatory 
element (e.g. enhancer region, promoter region) means that the transcriptional activity 
driven by these regions is mostly in muscle cells or tissue (e.g. 20:1) compared to the 
activity conferred by the regulatory sequences in other tissues. An assay to determine 
the muscle-specificity of a regulatory region is provided in Example 5 below 
(measuring beta-galactoside in muscle cells and liver cells from a mouse transfected 
with an expression vector). 

As used herein, the term "mutant muscle -specific enhancer region" refers to a 
wild-type muscle-specific enhancer region that has been modified (e.g. deletion, 
insertion, addition, substitution), and in particular, has been modified to contain an 
additional MCK-R control element (See U.S. Prov. App. Ser. No. 60/218,436, hereby 
incorporated by reference, and section IV below). 

DESCRIPTION OF THE INVENTION 

The present invention relates to compositions and methods for expressing minir 
dystrophin peptides. In particular, the present invention provides compositions 
comprising nucleic acid sequences that are shorter than wild-type dystrophin cDNA 
and that express mini-dy strop hin peptides that function in a similar manner as wild- 
type dystrophin proteins, and methods for expressing mini-dystrophin peptides in target 
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cells. The present invention provides such shortened nucleic acid sequences in a 
variety of ways. For example, the present invention provides nucleic acid encoding 
only 4, 8, 12, 16, and 20 spectrin-like repeat encoding sequences (i.e. nucleic acid 
encoding an exact number of spectrin-like repeats that are multiples of 4). As wild- 
type dystrophin has 24 spectrin-like repeat encoding sequences, providing nucleic acid 
encoding fewer numbers of repeats reduces the size of the dystrophin gene (e.g, 
allowing the nucleic acid sequence to fit into vectors with limited cloning capacity). 
Another example of such shortened nucleic acid sequences are those that lack at least 
a portion of the carboxy-terminal domain of wild-type dystrophin nucleic acid. A 
further example of such shortened nucleic acid sequences are those that lack at least a 
portion of the 3' untranslated region, or 5' mistranslated region, or both. 

I. Dystrophin 

A. Dystrophin Structure 

In some embodiments, the present invention provides gene constructs 
comprising spectrin-like repeats from human dystrophin. Dystrophin is a 427 kDa 
cytoskeletal protein and is a member of the spectrin/aactinin superfamily (See e.g., 
Blake et al, Brain Pathology, 6:37 [1996]; Winder, J. Muscle Res. Cell. Motil., 18:617 
[1997]; and Tinsley et al., PNAS, 91:8307 [1994]). The N-terminus of dystrophin 
binds to actin, having a higher affinity for non-muscle actin than for sarcomeric actin. 
Dystrophin is involved in the submembraneous network of non-muscle actin 
underlying the plasma membrane. Dystrophin is associated with an otigomeric, 
membrane spanning complex of proteins and glycoproteins, the dystrophin-associated 
protein complex (DPC). The N-terminus of dystrophin has been shown in vitro to 
contain a functional actin-binding domain. The C-terminus of dystrophin binds to the 
cytoplasmic tail of p-dystroglycan, and in concert with actin, anchors dystrophin to the 
sarcolemma. Also bound to the C-terminus of dystrophin are the cytoplasmic 
members of the DPC. Dystrophin thereby provides a link between the actin-based 
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cytoskeleton of the muscle fiber and the extracellular matrix. It is this link that is 

disrupted in muscular dystrophy. 

The central rod domain of dystrophin is composed of a series of 24 weakly 
repeating units of approximately 110 amino acids, similar to those found in spectrin 
(i.e., spectrin-like repeats). This domain constitutes the majority of dystrophin and 
gives dystrophin a flexible rod-like structure. The rod-domain is interrupted by four 
hinge regions that are rich in proline. It is contemplated that the rod-domain provides 
a structural link between member of the DPC. Table 1 shows an overview of the 
structural and functional domains of human dystrophin. 



Table 1 - Full Length Human Dystrophin cDNA 



Nucleotides 

1-208 



209-211 



209-964 



965-1219 



1220-1546 
1547-1879 



1880-2212 
2213-2359 



2360-2692 
2693-3019 
3020-3346 



3347-3673 
3674-4000 



Feature 

5' untranslated region 



Start codon (ATG) 



N terminus 



Hinge 1 



Spectrin-lik e repeat No. 1 
Spectrin-like repeat No. 2 



Spectrin-li ke repeat No. 3 
Hinge 2 



Spectrin-like repeat No. 4 



Spectrin-lik e repeat No. 5 
Spectrin-like repeat No. 6 



Spectrin -like repeat No. 7 
Spectrin-like repeat No. 8 



SEQ ID NO: 

SEQ ID NO:5 



SEQ ID NO:6 



SEQ ID NO:7 



SEQ IP NO:8 
SEQ ID NO:9 



SEQ ID NO: 10 



SEQ ID NO; 11 



SEQ ID NO: 12 



SEQ ID NO: 13 



SEQ ID NO: 14 



SEQ ID NO:15 
SEQ ID NO: 16 



4001-4312 



4313-4588 
4589-4915 



Spectrin-like repeat No. 9 



Spectrin-lik e repeat No. 10 
Spectrin-like repeat No. 1 1 



SEQ ID NO:17 



SEQ ID NO:18 



SEQ IDNO:19 
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D 

w 

w 
•P 5 

03 



D 
G 
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4916-5239 



5340-5551 



5552-5833 



5834-6127 
6128-6187 
6188-6514 



6515-6835 



6836-7186 



7187-7489 



7490-7612 



7613-7942 



7943-8269 



8270-8617 



8618-9004 



9005-9328 



9329-9544 
9545-10431 



10432-11254 



11255-11266 
11267-13957 



Spectrin-like repeat No. 12 



SEQ ID NO:20 



Spectrin-like repeat No. 13 



SEQ ID N0.21 



Spectrin-like repeat No. 14 



SEQ ID NO:22 



Spectrin-like re peat No. 15 
20 a mino acid insert (not hinge 
Spectrin-like repeat No. 16 



SEQ ID NO:23 



) 



SEQ ID NO:24 



Spectrin-like repeat No. 17 



SEQ ID NO: 25 



Spectrin -like repeat No. 18 
Spectrin-like repeat No. 19 
Hinge 3 



SEQ ID NO:26 
SEQ ID NO:27 
SEQ ID NO:28 



Spectrin-like repeat No. 20 



SEQ ID NO:29 



Spectrin-like repeat No. 21 



SEQ ID NO: 30 



Spectrin-like repeat No. 22 



.SEQIDNO:31 



Spectrin-like repeat No. 23 



SEQ ID NO:32 



Spectrin-li ke repeat No. 24 
Hinge 4 

Start of C terminus 



SEQ ID NO:33 
SEQ ID NO:34 



S EQ ID NO:35 
Alternatively spliced exons 71-78 I SEQ ID NO:36 



End of Coding Region 
3* untranslated region 



SEQ ID NO:37 
SEQ ID NO:38 



Domain structure based on winder et al.. tebs Letters, iW.ii-a (.1995) 
B. Spectrin-Like Repeats 

Spectrin-like repeats are about 100 amino acids long and are found in a number 
of proteins, including, the actin binding proteins spectrin, fodrin, a-actinin, and 
dystrophin, but their function remains unclear (Dherray, 1991. Biol. Cell, 71:249-254). 
These domains may be involved in connecting functional domains and/or mediate 
protein-protein interactions. The many tandem, spectrin-like motifs that comprise most 
of the mass of the proteins in this superfamily are responsible for their similar flexible, 
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rod-like molecular shapes. Although these homologous motifs are frequently called 
repeats or repetitive segments, adjacent segments in each protein are only distantly. 

related evolutionary. 

Spectrin is a cytoskeletal protein of red blood cells that is associated with the 
cytoplasmic side of the lipid bilayer (See e.g., Speicher and Ursitti, Current Biology, 
4:154 [1994]). Spectrin is a long-thin flexible rod-shaped protein that constitutes 
about 25% of the membrane-associated protein mass. Spectrin is composed of two 
large polypeptide chains, a-spectrin (-240 kDa) and p-spectrin (-220 kDa) and serves 
to cross-link short actin oligomers to form a dynamic two-dimensional submembrane 
latticework, Spectrin isoforms have been found in numerous cell types and have been 
implicated in a variety of functions. 

The recent determination of the crystal structure of a single domain of spectrin 

■ 

provides insight into the structure function of an entire class of large actin 
cross-linking proteins (Yan et al.. Science, 262:2027 [1993]). The domain is an 
example of a spectrin-like repeat. Early analysis of spectrin-like repeats by partial 
peptide sequence analysis demonstrated that most of the antiparallel spectrin 
heterodimer is made up of homologous 106 residue motifs. Subsequent sequence 
analyses of cDNAs confirmed that this small motif is the major building block for all 
spectrin isoforms, as well as for the related actinins and dystrophins (Matsudaira, 
Trends Biochem Sci, 16:87 [1991]). 

Given their similar sequences, all spectrin motifs are expected to have related, 
but not identical, three-dimensional structures. The structure of a single Drosophila 
spectrin motif, 14, which has now been determined (Yan et al., Science, 262:2027 
[1993]), should therefore provide insight into the overall conformation of spectrins in 
particular and, to a more limited extent, the other members of the spectrin superfaroily. 
The structure shows that the spectrin motif forms a three-helix bundle, similar to the 
earliest conformational prediction based on the analysis of multiple homologous motifs 
(Speicher and Marchesi, Nature, 31 1:177 [1984]). 
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II. Variants and Homo logs of Dystrophin 

The present invention is not limited to the spectrin-like repeat encoding 
sequences SEQ ID NOS:8-10, 12-27, and 29-33, but specifically includes nucleic acid 
sequences capable of hybridizing to the spectrin-like repeat encoding sequences SEQ 
ID NOS::8-10, 12-27, and 29-33, (e.g. capable of hybridizing under high stringent 
conditions). Those skilled in the art know that different hybridization stringencies 
may be desirable. For example, whereas higher stringencies may be preferred to 
reduce or eliminate non-specific binding between the spectrin-like repeat encoding 
sequences SEQ ID NOS:8-10, 12-27, and 29-33, and other nucleic acid sequences, 
lower stringencies may be preferred to detect a larger number of nucleic acicj 
sequences having different homologies to the nucleotide sequence of SEQ ID NOS:8- 

10, 12-27, and 29-33. 

Accordingly, In some embodiments, the dystrophin spectrin-like repeats of the 
compositions of the present invention (e.g., SEQ ID NOs:8-10, 12-27, and 29-33) are 
replaced with different spectrin-like repeats, including, but not limited to, variants, 
homologs, truncations, and additions of dystrophin spectrin-like repeats. Candidate 
spectrin-like repeats are screened for activity using any suitable assay, including, but 
not limited to, those described below and in illustrative Examples 1 and 5. 

A. Homologs 

1. Dystrophin From other Species 

In some embodiments, the spectrin-like repeats of the gene constructs of the 
present invention are replaced with spectrin-like repeats of dystrophin from other 
species (e.g., homologs of dystrophin), including, but not limited to, those described 
herein. Homologs of dystrophin have been identified in a variety of organisms, 
including mouse (Genbank accession number M68859); dog (Genbank accession 
number AF070485); and chicken (Genbank accession number XI 3369). The spectrin 
like repeats of the mouse dystrophin gene were compared to the human gene (See 
Figure 1 1) and were shown to have significant homology. Similar comparisons can I 
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generated with homologs from other species, including but not limited to, those 
described above, by using a variety of available computer programs {e.g., BLAST, 
from NCBI). Candidate homologs can be screened for biological activity using any 
suitable assay, including, but not limited to, those described herein. 

2. Utrophin 

In some embodiments, the spectrin-like repeats of the gene constructs of the 
present invention are replaced with spectrin-like repeats from another peptide (e.g., 
homologs of dystrophin). For example, in some embodiments, spectrin-like repeats 
from the utrophin protein (See e.g., Genbank accession number X69086; SEQ ID 
NO:3; Figure 3) are utilized. Utrophin is an autosomally-encoded homolog of 
dystrophin and has been postulated that the proteins play a similar physiological role 
(For a recent review, See e.g., Blake et ai, Brain Pathology, 6:37 [1996]). Human 
utrophin shows substantial homology to dystrophin, with the major difference, 
occurring in the rod domain, where utrophin lacks repeats 15 and 19 and two hinge 
regions (See e.g., Love et al. t Nature 339:55 [1989]; Winder et a/., FEBS Lett., 369:27 
[1995]). Utrophin thus contains 22 spectrin-like repeats and two hinge regions. A 
comparison of the rod domain of Utrophin and Dystrophin is shown in Figure 38. 

In addition, in some embodiments, spectrin-like repeats from a homolog of 
utrophin are utilized. Homologs of utrophin have been identified in a variety of 
organisms, including mouse (Genbank accession number Y12229; SEQ ID NO:4; 
Figure 4) and rat (Genbank accession number AJ002967). The nucleic acid sequence 
of these or additional homologs can be compared to the nucleic acid sequence of 
human utrophin using any suitable methods, including, but not limited to, those 
described above. Candidate spectrin-like repeats from human utrophin or utrophin 
homologs can be screened for biological activity using any suitable assay, including, 
but not limited to, those described herein. 
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3. Alpha-actinin 

In some embodiments, spectrin-like repeats from Dystrophin are replaced with 
spectrin-like repeats from alpha-actinin. The microfilament protein alpha-actinin exists 
as a dimer. The N-termina! regions of both polypeptides, arranged in antiparallel 
orientation, comprise the actin-binding regions, while the C-terminal, larger parts 
consist of four spectrin-like repeats that interact to form a rod-like structure (See e.g., 
Winkler et al, Eur. J. Biochem., 248:193 [1997]). In some embodiments, human 
alpha-actinin spectrin-like repeats are utilized (Genbank accession number M86406; 
SEQ ID NO-.87; Figure 16). In other embodiments, alpha-actinin homologs from other 
organisms are utilized (e.g., mouse (Genbank accession number AJ289242); Xenopus 
(Genbank accession number BE576799); and rat (Genbank accession number ; 
AF 190909). 

B. Variants 

Still other embodiments of the present invention provide mutant or variant 
forms of spectrin-like repeats (i.e., muteins). It is possible to modify the structure of a 
peptide having an activity of spectrin-like repeats for such purposes as enhancing 
therapeutic or prophylactic efficacy, or stability (e.g., ex vivo shelf life, and/or 
resistance to proteolytic degradation in vivo). Such modified peptides provide 
additional peptides having a desired activity of the subject spectrin-like repeats as 
defined herein. A modified peptide can be produced in which the amino acid 
sequence has been altered, such as by amino acid substitution, deletion, or addition. 

Moreover, as described above, variant forms (e.g., mutants) of the subject 
spectrin-like repeats are also contemplated as finding use in the present invention. For 
example, it is contemplated that an isolated replacement of a leucine with an 
isoleucine or valine, an aspartate with a glutamate, a threonine with a serine, or a 
similar replacement of an amino acid with a structurally related amino acid (i.e., 
conservative mutations) will not have a major effect on the biological activity of the 
resulting molecule. Accordingly, some embodiments of the present invention provide 
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variants of spectrin-like repeats containing conservative replacements. Conservative 
replacements are those that take place within a family of amino acids that are related 
in their side chains. Genetically encoded amino acids can be divided into four 
families: (1) acidic (aspartate, glutamate); (2) basic (lysine, arginine, histidine); (3) 
5 nonpolar (alanine, valine, leucine, isoleucine, proline, phenylalanine, methionine, 

tryptophan); and (4) uncharged polar (glycine, asparagine, glutamine, cysteine, serine, 
threonine, tyrosine). Phenylalanine, tryptophan, and tyrosine are sometimes classified 
jointly as aromatic amino acids. In similar fashion, the amino acid repertoire can be 
grouped as (1) acidic (aspartate, glutamate); (2) basic (lysine, arginine histidine), (3) 
10 aliphatic (glycine, alanine, valine, leucine, isoleucine, serine, threonine), with serine 

and threonine optionally be grouped separately as aliphatic-hydroxyl; (4) aromatic 
(phenylalanine, tyrosine, tryptophan); (5) amide (asparagine, glutamine); and (6) sulfur 
q -containing (cysteine and methionine) (See e.g., Stryer (ed.), Biochemistry, 2nd ed, 

W WH Freeman and Co. [1981]). Whether a change in the amino acid sequence of a 

ui 

QJ15 peptide results in a functional homolog can be readily determined by assessing the 

^ ability of the variant peptide to function in a fashion similar to the wild-type protein. 

CO Peptides in which more than one replacement has taken place can readily be tested in 

^ the same manner. 

p The present invention further contemplates a method of generating sets of 

fffeO combinatorial mutants of the present spectrin-like repeats, as well as truncation 
O 

mutants, and is especially useful for identifying potential variant sequences (i.e., 
homologs) that possess the biological activity of spectrin-like repeats (e.g., a decrease 
in muscle necrosis). In addition, screening such combinatorial libraries is used to 
generate, for example, novel spectrin-like repeat homologs that possess novel 
25 biological activities all together. 

Therefore, in some embodiments of the present invention, spectrin-like repeat 
homologs are engineered by the present method to produce homologs with enhanced 

* + * 

biological activity. In other embodiments of the present invention, 
combinatorially-derived homologs are generated which provide spectrin-like repeats 
30 that are easier to express and transfer to host cells. Such spectrin-like repeats, when 
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expressed from recombinant DNA constructs, can be used in therapeutic embodiments 
of the invention described below. 

Still other embodiments of the present invention provide spectrin-like repeat 
homologs which have intracellular half-lives dramatically different than the 
5 corresponding wild-type protein. For example, the altered proteins comprising the 
spectrin-like repeat homologs are rendered either more stable or less stable to 
proteolytic degradation or other cellular process that result in destruction of, or 
otherwise inactivate spectrin-like repeats. Such homologs, and the genes that encode 
them, can be utilized to alter the pharmaceutical activity of constructs expressing 
10 spectrin-like repeats by modulating the half-life of the protein. For instance, a short 
half-life can give rise to more transient biological effects. As above, such proteins 
find use in pharmaceutical applications of the present invention. 

2 In some embodiments of the combinatorial mutagenesis approach of the present 
□ 

flJ invention, the amino acid sequences for a population of spectrin-like repeat homologs 
W 

gJ5 are aligned, preferably to promote the highest homology possible. Such a population 

|j* of variants can include, for example, spectrin-like repeat homologs from one or more 

E3 species, or spectrin-like repeat homologs from different proteins of the same species 

\a (e.g., including, but not limited to, those described above). Amino acids that appear at 

£j each position of the aligned sequences are selected to create a degenerate set of 

d>0 combinatorial sequences. 
D 

q In a preferred embodiment of the present invention, the combinatorial spectrin- 

like repeat library is produced by way of a degenerate library of genes encoding a 
library of polypeptides that each include at least a portion of candidate spectrin-like 
repeat sequences. For example, a mixture of synthetic oligonucleotides is 
25 enzymatically ligated into gene sequences such that the degenerate set of candidate 
spectrin-like repeat sequences are expressible as individual polypeptides, or 
alternatively, as a set of larger fusion proteins (e.g., for phage display) containing the 
set of spectrin-like repeat sequences therein. 

There are many ways by which the library of potential spectrin-like repeat 
30 homologs can be generated from a degenerate oligonucleotide sequence. In some 
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member of the DPC complex {e.g., actin) is assayed. In other embodiments of the 
present invention, the gene library is cloned into the gene for a surface membrane 
protein of a bacterial cell, and the resulting fusion protein detected by panning (WO 
88/06630; Fuchs et al % BioTechnol., 9:1370 [1991]; and Goward et ai $ TIBS 18:136 
[1992]), In other embodiments of the present invention, fluorescently labeled 
molecules that bind proteins comprising spectrin like repeats (e.g., actin), can be used 
to score for potentially functional spectrin-like repeat homologs. Cells are visually 
inspected and separated under a fluorescence microscope, or, where the morphology of 
the cell permits, separated by a fluorescence-activated cell sorter. 

In an alternate embodiment of the present invention, the gene library is 
expressed as a fusion protein on the surface of a viral particle. For example, foreign 
peptide sequences are expressed on the surface of infectious phage in the filamentous 
phage system, thereby conferring two significant benefits. First, since these phage can 
be applied to affinity matrices at very high concentrations, a large number of phage 
can be screened at one time. Second,, since each infectious phage displays the 
combinatorial gene product on its surface, if a particular phage is recovered from an . 
affinity matrix in low yield, the phage can be amplified by another round of infection. 
The group of almost identical £. coli filamentous phages M13, fd, and fl are most 
often used in phage display libraries, as either of the phage gill or gVIII coat proteins 
can be used to generate fusion proteins without disrupting the ultimate packaging of 
the viral particle (See e.g., WO 90/02909; WO 92/09690; Marks et al. 7 J. Biol. Chem., 
267:16007 [1992]; Griffths et al. t EMBO J., 12:725 [1993]; Clackson et a/., Nature, 
352:624 [1991];. and Barbas et a/., Proc. Natl. Acad. Sci., 89:4457 [1992]). 

In another embodiment of the present invention, the recombinant phage 
antibody system (e.g., RPAS, Pharmacia Catalog number 27-9400-01) is modified for 
use in expressing and screening of spectrin-like repeat combinatorial libraries. The 
pCANTAB 5 phagemid of the RPAS kit contains the gene that encodes the phage gin 
coat protein. In some embodiments of the present invention, the spectrin-like repeat 
combinatorial gene library is cloned into the phagemid adjacent to the gill signal 
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seque n C e such that it is exposed as asm fusion^. In other embody 
the" present invention, the phagemid is used to transform competent * coU TG 
after .igation. In *h other embodiments of the present invent transform* ^ 
are subsequently infected with M13K07 helper phage to rescue the phagemtd and » 
candidate spectrin-like repeat gene insert. The rating recombinant phage contam 
phagemid DNA encoding a specific candidate spectrin-like repeat and display one or 
more copies of the corresponding fusion coat protein. In some embodiments of the 
present invention, the phage-displayed candidate proteins that are capable of, for 
example, binding to actin, are selected or enriched by panning. The bound phage » 
then isolated, and if the recombinant phage express at least one copy of the w,ld type 
g «I coat protein, they will retain their ability to infect E. coli. Thus, successive . 
rou „ds of reinfection of,. coU and panning will greatly enrich for spectrin-like repeat 
homology which can then be screened for further biological actmt.es. 

In light of the present disclosure, other forms of mutagenesis generally 
applicable will be apparent to those skilled in the art in addition to the aforementioned 
.tional mutagenesis based on conserved versus non-conserved residue, For example, 
spectrin-like repeat homologs can be generated and screened using, for example, 
alanine scanning mutagenesis and the like (Ruf et al, Biochem., 33:1565 [1994]; 
Wang et al, J. Biol. Chem., 269:3095 [1994]; Balint et al Gene 137:109 [1993]; 
Grodberg et al, Eur. J. Biochem., 218:597 [1993]; Nagashima et al, J. Biol. Chem. 
268 2888 [1993]; Lowman et al, Biochem., 30:10832 [1991]; and Cunningham et al, 
Science, 244:1081 [1989]), by linker scanning mutagenesis (Gustin et al, VnoL, 
193.653 [1993]; Brown ,/ */., Mol. Cell. Biol., 12:2644 [1992]; McKnight et al, 
Science, 232:316); or by saturation mutagenesis (Meyers et al, Science, 232:613 
[1986]). 

C Truncations and Additions 

In yet other embodiments of the present invention, the spectrin-like repeats of 
human dystrophin are replaced by truncation or additions of spectrin-like repeats from 
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dystrophin or another protein, including, but not limited to, those described above. 
Accordingly, in some embodiments, amino acids are truncated from either end of one 
or more of the spectrin-lilce repeats in a given construct. The activity of truncat.cn 
mutants is determined using any suitable assay, including, but not limited to, those 

disclosed herein. 

In some embodiments, additional amino acids are added to either or both ends 
of the spectnn-like repeats in a given construct. In some embodiments, single ammo 
acids are added and the activity of the construct is determined. Amino aads may be 
added to one or more of the spectrin-like repeats in a given construct. The actmty of 
spectrin-like repeats comprising additional amino acids is determined usmg any 
suitable assay, including, but not limited to, those disclosed herein. 

Ill Carboxy-Terminal Domain Truncated Dystrophin Genes 

In some embodiments, the present invention provides compositions comprising 
nucleic acid, wherein the nucleic acid encodes a mini-dystrophin peptide, and wherem 
.he mini-dystrophin peptide comprises a substantially deleted dystrophin C-termmal 
domain (e.g., 55% of the dystrophin C-terminal domain is missing). In some 
embodiments, this type of truncation prevents the mini-dystrophin peptide from 
binding both syntrophin and dystrobrevin. 

The dystrophin COOH-terminal domain is located adjacent to the cysteine-rich 
domain, and contains an alternatively spliced region and two coiled-coil motifs (Blake 
et aL. Trends Biochem. Sci., 20:133, 1995). The alternatively spliced region binds 
three isoforms of syntrophin in muscle, while the coiled-coil motifs bind numerous 
members of the dystrobrevin family (Sadoulct-Puccio et aL. PNAS, 94:12413, 1997). 
The dystrobrevins display significant homology with the COOH-terminal region of 
dystrophin, and the larger dystrobrevin isoforms also bind to the syntrophins. The 
importance and functional significance of syntrophin and dystrobrevin remains largely 
unknown, although they may be involved in cell signaling pathways (Grady « aL. Mat. 
Cell. Biol, 1:215, 1999). 
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Researchers have previously generated transgenic max mouse strains expressing 
dystrophins deleted for either the syntrophin or the dystrobrevin binding domain 
(Rafael et al, Hum. Mol Genet, 3:1725, 1994; and Rafael et al, J. Cell Biol, 134:93 
1996). These mice displayed normal muscle function and essentially normal 
localization of syntrophin, dystrobrevin, and nNOS. Thus, while dystrobrevin appears 
to protect muscle from damage (Grady ct al. Nat. Cell. Biol, 1:215, 1999), removal of 
the dystrobrevin binding site from dystrophin does not result in a dystrophy. 
Subsequent studies revealed that syntrophin and dystrobrevin bind each other in 
addition to dystrophin, so that removal of only one of the two binding sites on 
dystrophin might not sever the link between dystrophin, syntrophin and dystrobrevin. 
Surprisingly, the transgenic mice according to the present invention (See Example 1) 
displayed normal muscle function even though they lacked both the syntrophin and 
dystrobrevin binding sites. 

IV, MCK Regulatory Regions 

In certain embodiments, nucleic acid encoding mini-dystrophin peptides of the 
present invention are operably linked to muscle creatine kinase gene (MCK) regulatory 
regions and control elements, as well as mutated from of these regions and elements, 
(see U.S. Prov. App. Ser. No. 60/218,436, hereby incorporated by reference). In some 
embodiments, the nucleic acid encoding mini-dystrophin peptides is operably linked to 
these sequences to provide muscle specificity and reduced size such that the resulting 
construct is able to fit into, for example, a viral vector (e.g. adeno-associated virus). 
MCK gene regulatory regions (e.g. promoters and enhancers) display striated muscle- 
specific activity and have been characterized in vitro and in vivo. The major known 
regulatory regions in the mouse MCK gene include a 206 base pair muscle-specific 
enhancer located approximately 1.1 kb 5* of the transcription start site in mouse (i.e. 
SEQ ID NO:87) and a 358 base pair proximal promoter (i.e. SEQ ID NO:93) [Shield, 
et al, Mol Cell Biol 9 16:5058 (1996)]. A larger MCK promoter region may also be 
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employed (e.g. SEQ ID NO:92), as well as smaller MCK promoter regions (e.g. SEQ 
IDNO:94). 

The 206 base pair MCK enhancer (SEQ ID NO:87) contains a number of 
sequence motifs, including two classes of E-boxes (MCK-L and MCK-R), CarG, and 
5 AT-rich sites. Similar E-box sequences are found in the enhancers of the human, rat, 
and rabbit MCK genes [See, Trask, et al.. Nucleic Acids Res., 20:2313 (1992)]. 
Mutation may be made to this sequence by, for example, inserting an additional MCK- 
R control element into a wild-type enhancer sequence naturally containing one MCK- 
R control element (such that the resulting sequence has at least two MCK-R control 
10 elements). For example, the inserted MCK-R control element replaces the endogenous 
MCK-L control element. The 206 base pair mouse enhancer (SEQ ID NO:2) may be 
modified by replacing the left E-box (MCK-L) with a right E-Box (MCK-R) to 
2 generate a mutant muscle-specific enhancer region (e.g. to generate SEQ ID NO:88). 

W. A similar approximately 200 base pair wild type enhancer region in human may be 

S 5 : modified by replacing the left E-box with a MCK-R to generate a mutant muscle- 
j» specific enhancer region (e.g. 2R human enhancer regions). 

CO Another modification that may be made to generate mutant muscle-specific 

U enhancer regions by inserting the S5 sequence GAGCGGTTA (SEQ ID NO:95) into 

2 wild type mouse, human, and rat enhancer sequence. Making such a modification to 

5i0 the mouse enhancer SEQ ID NO:87, for example, generates S5 mutant muscle-specific 
□ enhancer regions (e.g. SEQ ID NO:89). Another modification that may be made, for 

example, to the wild type mouse enhancer is replacing the left E-box (MCK-L) with a 
right E-Box (MCK-R), and also inserting the 5S sequence, to generate 2R5S type 
sequences {e.g. in mouse, SEQ ID NO:90). These mutant muscle-specific enhancer 
25 regions may have additional sequences added to them or sequences that are taken 

away. For example, the mutant muscle-specific enhancer regions may have a portion 
of the sequence removed (e.g. the 3' 41 base pairs). Examples of such mutant 
truncation 2RS5 sequences in mouse is SEQ ID NO:91 with the 3' 41 base pairs 
removed, generating mutant truncated 2RS5 muscle-specific enhancer regions. 
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Any of these wild-type or mutant muscle-specific enhancer regions described 
above may be further modified to produce additional mutants. These additional 
mutants include, but are not limited to, muscle-specific enhancer regions having 
deletions, insertions or substitutions of different nucleotides or nucleotide analogs so 
long as the transcriptional activity of the enhancer region is maintained. Guidance in 
determining which and how many nucleotide bases may be substituted, inserted or 
deleted without abolishing the transcriptional activity may be found using computer 
programs well known in the art, for example, DNAStar software or GCQ (Univ. of 
Wisconsin) or may be determined empirically using assays provided by the present 
invention. 

V. Expression Vectors 

The present invention contemplates the use of expression vectors with the 
compositions and methods of the present invention (e.g. with the nucleic acid 
constructs encoding the mini-dystrophin peptides). Vectors suitable for use with the 
methods and compositions of the present invention, for example, should be able to 
adequately package and carry the compositions and cassettes described herein. A 
number of suitable vectors are known in the art including, but are not limited to, the 
following: 1) Adenoviral Vectors; 2) Second Generation Adenoviral Vectors; 3) Gutted 
Adenoviral Vectors; 4) Adeno-Associated Virus Vectors; and 5) Lentiviral Vectors. 

Those skilled in the art will recognize and appreciate that other vectors are 
suitable for use with methods and compositions of the present invention. Indeed, the 
present invention is not intended to be limited to the use of the recited vectors, as 
such, alternative means for .delivering the compositions of the present invention are 
contemplated. For example, in various embodiments, the compositions of the present 
invention are associated with retrovirus vectors and herpes virus vectors, plasmids, 
cosmids, artificial yeast chromosomes, mechanical, electrical, and chemical 
transfection methods, and the like. Exemplary delivery approaches are discussed 
below. 
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1. Adenoviral Vectors 

Self-propagating adenovirus (Ad) vectors have been extensively utilized to 
deliver foreign genes to a great variety of cell types in vitro and in vivo. "Self- 
propagating viruses'* are those which can be produced by transfection of a single piece 
of DNA (the recombinant viral genome) into a single packaging cell line to produce 
infectious virus; self-propagating viruses do not require the use of helper virus for 
propagation. As with many vectors, adenoviral vectors have limitations on the amount 
of heterologous nucleic acid they are capable of delivering to celts. For example, the 
capacity of adenovirus is approximately 8-10 kb, the capacity of adeno-associated 
virus is approximately 4.8 kb, and the capacity of lentivirus is approximately 8.9 kb. 
Thus, the mutants of the present invention that provide shorter nucleic acid sequences 
encoding the mini-dystrophin peptides (compared to full length wild-type dystrophin 
(14kb)), improve the carrying capacity of such vectors. 

2. Second Generation Adenoviral Vectors 

In an effort to address the viral replication problems associated with first 
generation Ad vectors, so called "second generation" Ad vectors have been developed. 
Second generation Ad vectors delete the early regions of the Ad genome (E2A, E2B, 
and E4). Highly modified second generation Ad vectors are less likely to generate 
replication-competent virus during large-scale vector preparation, and complete 
inhabitation of Ad genome replication should abolish late gene replication. Host 
immune response against late viral proteins is thus reduced [See Amalfitano et aL, 
"Production and Characterization of Improved Adenovirus Vectors With the El, E2b, 
and E3 Genes Deleted," J. Virol. 72:926-933 (1998)]. The elimination of E2A, E2B, 
and E4 genes from the Ad genome also provide increased cloning capacity. The 
deletion of two or more of these genes from the Ad genome allows for example, the 
delivery of full length or cDNA dystrophin genes via Ad vectors [Kumar-Singh et al, 
Hum. Mol Genet., 5:913 (1996)]. 
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3. Gutted Adenoviral Vectors 

"Gutted," or helper dependent, Ad vectors contain exacting DNA sequences 
that direct adenoviral replication and packaging but do not contain viral coding 
sequences [See Fisher et al. "Recombinant Adenovirus Deleted of All Viral Genes for 
Gene Therapy of Cystic Fibrosis," Virology 217:11-22 (1996) and Kochanek et al "A 
New Adenoviral Vector: Replacement of All Viral Coding Sequences With 28 kb of 
DNA Independently Expressing Both Full-length Dystrophin and Beta-galactosidase m 
Proc. Nat. Acad. Sci. USA 93:5731-5736 (1996)]. Gutted vectors are defective viruses 
produced by replication in the presence of a helper virus, which provides all of the 
necessary viral proteins in trans. Since gutted vectors do not contain any viral genes, 
expression of viral proteins is not possible. 

Recent developments have advanced the field of gutted vector production [See 
Hardy et al, "Construction of Adenovirus Vectors Through Cre-lox Recombination, " J 
Virol. 71:1842-1849 (1997) and Hartigan-O'Conner et al, "Improved Production of 
Gutted Adenovirus in Cells Expressing Adenovirus Preterminal Protein and DNA 
Polymerase," 7. Virol 73:7835-7841 (1999)]. Gutted Ad vectors are able to 
maximally accommodate up to about 37 kb of exogenous DNA, however, 28-30 kb is 
more typical. For example, a gutted Ad vector can accommodate the full length 
dystrophin or cDNA, but also expression cassettes or modulator proteins. 

4. A deno- Associated Virus Vectors 

In preferred embodiments, the nucleic acid encoding the mini-dystrophin 
peptides of the present invention are inserted in adeno-associated vectors (AAV 
vectors). AAV vectors evade a host's immune response and achieve persistent gene 
expression through avoidance of the antigenic presentation by the host's professional 
APCs such as dendritic cells. Most AAV genomes in muscle tissue are present in the 
form of large circular multimers. A A V's- are only able to carry about 5 kb of 
exogenous DNA. As such, the nucleic acid of the present invention encoding the 
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mini-dystrophin peptides is well suited, in some embodiments, for insertion into these 
vectors due the reduced size of the nucleic acid sequences. 

The dystrophin expression cassettes of the present invention (containing nucleic 
acid encoding mini-dystrophin peptides) may be cloned into any of a variety of 
cis-acting plasmid vectors that contain the adeno-associated virus inverted terminal 
repeats (ITRs) to allow production of infectious virus. For example, one such plasmid 
is the cis-acting plasmid (pCisAV) (Yan et ai, PNAS, 97:6716-6721. 2000). This 
plasmid contains the AAV-ITRs separated by a NotI cloning site. The ITR elements 
were derived from P Sub201, a recombinant plasmid from which an infectious 
adeno-associated virus genome can be excised in vitro and used to study viral 
replication. After ligation of the dystrophin expression cassette (isolated as a NotI 
fragment from P CK6DysR4-23-71-78An) into Notl-digested pCisAV. rAAV stocks are 
generated by cotransfection of pCisAV. CK6DysR4-23-71-78An and pRep/Cap 
(Fisher, et ai, J. Virol. 70:520-532, 1996) together with connection of the recombinant 
adenovirus Ad.CMVlacZ into 293 cells. Recombinant AAV vector, for example, may 
then be purified on CsCl gradients as described (Duan, et al, Virus Res. 48:41-56, 
1997). 

5. Lentiviral Vectors 

Vectors based on human or feline lentiviruses have emerged as another vector 
useful for gene therapy applications. Lentivirus-based vectors infect nondividing cells 
as part of their normal life cycles, and are produced by expression of a package-able 
vector construct in a cell line that expresses viral proteins. The small size of lentiviral 
particles constrains the amount of exogenous DNA they are able to carry to about 10 
kb. However, once again, the small size nucleic acid encoding the mini-dystrophin 
peptides of the present invention allow such vectors to be employed. 

6. Retroviruses 

Vectors based on Moloney murine leukemia viruses (MMLV) and other 
retroviruses have emerged as useful for gene therapy applications. These vectors 
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stably transduce actively dividing cells as part of their normal life cycles, and integrate 
into host cell chromosomes. Retroviruses may be employed with the compositions of 
the present invention (e.g. gene therapy), for example, in the context of infection and 
transduction of muscle precursor cells such as myoblasts, satellite cells, or other 
muscle stem cells. 

EXPERIMENTAL 

The following examples are provided in order to demonstrate and further 
illustrate certain preferred embodiments and aspects of the present invention and are 
not to be construed as limiting the scope thereof. 

In the experimental disclosure which follows, the following abbreviations 
apply: N (normal); M (molar); mM (millimolar); uM (micromolar); mol (moles); 
mmol (millimoles); nmol (micromoles); nmol (nanomoles); pmol (picomoles); g 
(grams); mg (milligrams); ug (micrograms); ng (nanograms); I or L (liters); ml 
(milliliters); ul (microliters); cm (centimeters); mm (millimeters); urn (micrometers); 
nm (nanometers); °C (degrees Centigrade); Sigma (Sigma Chemical Co., St. Louis, 
MO); and 

EXAMPLE 1 

Carboxy-Terminal Domain Truncated Dystrophin Genes 

This example describes the generation of carboxy-terminal truncated dystrophin 
nucleic acid sequences. In particular, this examples describes the construction of 
dystrophin nucleic acid sequence with the entire carboxy-terminal domain deleted, and 
testing of this sequence in a mouse model for DMD. 

A. Methods 

The bases encoding amino acids 3402-3675 (corresponding to exons 71-78) 
were deleted from the full length murine dystrophin cDNA (SEQ ID NO:2, accession 
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No. M68859) by recombinant PCR, leaving the last three amino acids (exon 79) of the 
dystrophin protein unaltered. This dystrophin A71-78 cDNA was cloned into an 
expression vector containing bases -2139 to +239 of the human -skeletal actin (HSA) 
promoter (Brennan, et al, J, Biol Chem. 268:719, 1993). A splice acceptor from the 
5 ,SV40 VF1 intron (isolated as a 400 bp Hindlll/Xbal fragment from pSVL; Amersham 
Pharmacia Biotech) was inserted immediately 3' of the HSA fragment, and the SV40 
polyadenylation signal (isolated as a BamHI fragment from pCMVfi; MacGregor arid 
Caskey, Nuc. Acid. Res., 17:2365, 1989) was inserted 3' of the dystrophin.cDNA. 
The excised dystrophin A71-78 expression cassette was injected into wild-type 
10 C57B1/10 x SJL/J F2 hybrid embryos, and F 0 mice were screened by PCR. Five 

positive F 0 's were backcrossed onto the C57Bl/10mdx background, and the line with 
the most uniform expression levels was selected for analysis; Also employed were 
□ previously described transgenic mdx mice that express dystrophin constructs deleted 
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approximately for exons 71-74 (A71-74) or exons 75-78 (A75-78), which, remove 



S}15 amino acids 3402-3511 and 3528-3675, respectively, See Rafael et al, / Cell Biol, 
£ 134:93-102, 1996). Transgenic mdx line Dp71 expresses the Dp71 isoform of 



dystrophin in striated muscle (Cox et al, Nat. Genet., 8:333-339, 1994). 



i. Morphology Methods 

Quadriceps, soleus, extensor digitorum longus (EDL), tibialis anterior, and 
□20 diaphragm muscles were removed from the mice, frozen in liquid nitrogen cooled 

O.CT. embedding medium (Tissue-Tek), and cut into 7-um sections. After fixing in 
3.7% formaldehyde, sections were stained in hematoxylin and eosin-phloxine. Stained 
sections were imaged with a Nikon E1000 microscope connected to a Spot-2 CCD 
camera. To determine the percentage of fibers containing central nuclei, the number 
25 of muscle fibers with centrally-located nuclei was divided by the total number of 

muscle fibers. 
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II. Evans Blue Assays 
4 month old control mice and 71-78 mice were analyzed after injection with 
Evans blue, as described previously (Straub el al, J. Cell. Biol., 139:375-385, 1997). 
In brief, mice were tail vein-injected with 150 ul of a solution containing 10 mg/ml 

5 Evans blue dye in PBS (150 mM NaCl, 50 mM Tris, pH 7.4). After 3 hours, the 
animals were euthanized and mouse tissues were eitheT fixed in 3.7% 
formaldehyde/0.5% glutaraldehyde to observe gross dye uptake/ or frozen unfixed in 
O.C.T. embedding medium. To examine Evans blue uptake by individual fibers, 
7-um-thick frozen sections were fixed in cold acetone and analyzed by fluorescence 

10 microscopy, 

111. Immunofluorescence Assays 
2 . Quadriceps and diaphragm muscles from C57B1/10, mdx, and A71-78 mice 

ft! were removed, frozen in O.C.T. embedding medium, and cut into 7-um sections. 

m Immunofluorescence was performed with previously described antibodies against 

?15 dystrophin (NH, terminus), al-syntrophin (SYN17), Bl-syntrophin, a-dystrobrevin-1 
K (DB670), a-dystrobrevin-2 (DB2), and utrophin. After incubation with primary 

U antibodies, cryosections were incubated with an FITC-conjugated goat anti-rabbit 

2 secondary antibody and fluorescent images were viewed on a Nikon El 000 

B» microscope. Antibodies to a-sarcoglycan (Rabbit 98), B-sarcoglycan (Goat 26), 

£ 20 y-sarcoglycan (Rabbit 245), 8-sarcoglycan (Rabbit 2 1 5), sarcospan (Rabbit 235), 

a-dystroglycan (Goat 20), B-dystroglycan (AP 83), or nNOS (Rabbit 200) have been 
described previously (Duclos et.ai, J. Cell. Biol., 142:1461, 1998). Cy3-conjugate<l 
secondary antibodies were used and images were viewed on a Bio-Rad MRC-600 laser 
scanning confocal microscope. AH digitized images were captured under the same 

4 

25 conditions. 

iv. Measurements of Contractile Properties Methods 
Contractile properties of muscles from 6-month-old A71-78 transgenic mice 
were compared with those of C57BV10 wild-type and mdx mice using methods 
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described previously (Lynch * aL. Am. J. Pkysiol, 272:C2063, 1997). The samples 
included eight muscles each from the EDL, soleus, and diaphragm. Mice were deeply 
anesthetized wvth avertin and each muscle was isolated and dissected free from the 
m0 use After removal of the limb muscles, the mice were euthanized W1 th the 
5 removal of the diaphragm muscle. The muscles were immersed in a bath filled wUh 
oxygenated buffered mammalian Ringer's solution (137 mM Nad, 24 mM NaHCO,. 
! l m M glucose, 5 mM KC1, 2 mM CaCl„ 1 mM MgS0 4 , 1 mM NaH,P0 4 , and 0.025 
mM tubocurarin chloride, pH 7.4). For each muscle, one tendon was tied to a 
servomotor and the other tendon to a force transduce, Muscles were stretched from 
10 ■ slack length to the optimal length for force development and then stimulated at a 
frequency that produced absolute isometric tetanic force (mN). After the 
measurements of the contractile properties, the muscles were removed from the bath, 
g blotted and weighed to determine musde mass. Specific force (kN/m>) was calculated 

M by dividing absolute force by total fiber cross sectional area. 

W 

CO 

ffl v Muscle Membrane Isolation Methods 

1 Muscle microsomes from 12-14 month-old C57B1/10, mdx, A71-78, A71-74, 
f* A75-78, and Dp71 mice were prepared as described previously (Ohlendieck et al, J. 

2 Cell. Biol., 112:135, 1991). In brief, skeletal muscle was homogenized in 7.5-vol 
* homogenization buffer plus protease inhibitor Complete (Boehringer). The 

homogenate was centrifuged at 14,000 g for 15 min to remove cellular debris. The 
supernatant was filtered through cheesecloth and spun at 142,000 g for 37 minutes to 
collect microsomes. The microsome pellet was resuspended in KC1 wash buffer (0.6 
M KC1, 0.3 M sucrose, 50 mM Tris-HCl, pH 7.4) plus protease inhibitors and 
recentrifuged.at 142,000 g for 37 minutes to obtain KCl-washed microsomes. The 
final pellet was resuspended in 0.3 M sucrose and 20 mM Tris-maleate, pH 7.0. 
Samples were quantified by the Coomassie Plus Protein Assay Reagent (Pierce 
Chemical Co.) and equivalent protein loading was verified by SDS-PAGE. 
KCl-washed microsomes were analyzed by Western blot using antibodies against 
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vivo, reflecting damage to the dystrophic fiber sarcolemma (Matsuda et al t J. 
Biochem. (Tokyo), 118:959, 1995). Skeletal muscle fibers from dystrophin A71-78 
mice, like wild-type animals, were found not to be permeable to Evans blue dye. 

iii. Analysis of Centrally Nucleated Muscle Fibers 

Another hallmark of dystrophy in mdx mice is the presence of large numbers of. 
centrally-nucleated muscle fibers, reflecting cycles of fiber degeneration and 
regeneration (Torres and Duchen, Brain, 110:269, 1987). To estimate the degree of 
myofiber regeneration occurring in,A71-78 transgenic mice, centrally nucleated fibers 
were counted from a variety of muscle groups in age-matched wild-type, mdx, and 
A71-78 mice {See, Table 2). By 4 months of age, 71% of muscle fibers in mdx 
quadriceps muscles contained central nuclei, whereas wild-type muscles had <1%. 
Interestingly, 4 month old dystrophin A71-78 quadriceps muscles displayed 1% 
central nuclei, indicating that very little, if any, regeneration was occurring. When 
1 -year-old mice were compared, a modest increase in centrally nucleated fibers 
became apparent. Quadriceps muscles from A71-78 mice contained 10% centrally 
nucleated fibers, although diaphragm muscles still displayed <1%. EDL and soleus 
muscles displayed 5 and 8% centrally nucleated fibers, respectively. For comparison, 
1 -year-old wild-type mice had <1% centrally nucleated fibers in both limb and 
diaphragm muscles. Furthermore, 1 -year-old mdx limb muscles had 60% centrally 
nucleated fibers, whereas the diaphragm had 35%. 
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Table 2 

Percentage of Centrally Nucleated Fibers in Mouse Skeletal Muscles 



r . ■ ■! 1 1 



C57/B110 



mdx 



A71-78 



C57/B110 12 



mdx 



A71-78 



A71-74 



A75-78 



12 



12 



15 



15 



Quad: 



<1 



71 



<1 



65 



10 



8 



Dia 



<1 



58 



<1 



<1 



35 



<1 



<1 



<1 



Quad = quadriceps; Dia = diaphragm; TA 
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ND 
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ND 



<1 



^"tibialis anterior; Age is in months 



ND 



ND 



ND 



<1 



50 



<1 



ND 



ND 



ND 



<1 



61 



8 



ND 
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Previous studies of transgenic mice expressing dystrophins deleted for exons 
A71-74 (A71-74) or exons A75-78 (A75-78) revealed no increase in the numbers of 
centrally nucleated fibers by 4 months of age (Rafael et al 1996, see above). To 
contrast these mice with the 71-78 transgenics, central nuclei counts were performed 
on 15-month-old A71-74 and 75-78 mice. It was determined that these animals had 
central nuclei counts in between those of wild-type and A71-78 mice. The A71-74 
and A75-78 mice had 5 and 8% centrally nucleated fibers in quadriceps, respectively 
(Table 2). 

iv. Contractile Properties 
Compared with muscles of wild-type mice, those from mdx mice displayed a 
significant amount of necrosis, fibrosis, and infiltrating mononuclear cells, mdx 
skeletal muscles also displayed a loss of specific force-generating capacities when 
muscles were stimulated to contract in vitro, providing an extremely sensitive and 
quantitative measurement of the dystrophic process (Fig 20 A). In contrast, dystrophin 
A71-78 mice had no major abnormalities when subjected to the same analysis (Fig 20 
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B). Muscle mass for both EDL and diaphragm were not significantly different 
between dystrophin AT 1-78 and wild-type mice, whereas dystrophin A71-78 soleus 
muscles were slightly hypertrophied. When stimulated to contract, all three muscle 
groups displayed specific forces not significantly different from wild-type (P < 0.05). 
5 These results demonstrate that the dystrophin A71-78 protein has essentially the same 
functional capacity as the full-length protein. 

v. Localization of the DAP Complex in A71-78 Mice 
Immunofluorescent analysis of the peripheral DAP complex revealed 
al-syntrophin, Bl-syntrophin, a-dystrobrevin-1, and a-dystrobrevin-2 to be localized at 
10 the sarcolemma with dystrophin, despite the lack of syntrophin and dystrobrevin 

binding sites in the transgene-encoded dystrophin, al-syntrophin levels were similar 
q between wild-type and A71-78 mice. However, the levels of Bl-syntrophin were 

•V elevated at the membrane in A71-78 mice, particularly in those fibers that normally 

UJ 

K) express significant levels of this isoform. a-dystrobrevin-1 was primarily located at 

®15 the NMJ in wild-type mice, and was exclusively located at the NMJs in mdx mice. 

& Surprisingly, in dystrophin A71-78 mice, higher levels of a-dystrobrevin-1 were 

U observed at the sarcolemma than in wild-type mice. The A71-78 mice also displayed 

2 a slight increase in utrophin localization along the sarcolemma, but this increase was 

& less than the increase in mdx fibers. Immunofluorescent localization of the 

p20 sarcoglycans, a- and B-dystroglycan, sarcospan; and nNOS in A71-78 mice revealed 

no differences in the expression of these proteins when compared with wild-type mice. 

The proper localization of these proteins to the sarcolemma indicated that membrane 

targeting of the DAP complex components can proceed in the absence of the 

COOH-terminal domain of dystrophin. 

25 vi. DAP Complex Protein Levels 

To examine the levels of the DAP complex members that associate with 
dystrophin, muscle microsomes were prepared from wild-type and dystrophin A71-78 
mice and analyzed by Western blotting. This approach provides information on the 
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relative abundance of individual DAP complex members in muscles of separate lines 
of mice. Slightly elevated levels of B-dystroglycan were detected in dystrophin 
A71-78 mice, which' we have previously observed whenever dystrophin is 
overexpressed. Iso forms of syntrophin and dystrobrevin were present at slightly 
different levels when the dystrophin A71-78 membranes were compared with those 
from wild-type mice, a 1 -syntrophin and B2-syntrophin levels weTe lower than in 
wild-type mice, whereas the level of B 1 -syntrophin was elevated. Although there was 
approximately the same amount of a-dystrobrevin-2, there were elevated levels of 
a-dystrobrevin-1 in A71-78 microsomes. A reduction in nNOS was observed in 
dystrophin A71-78 muscle, indicating that nNOS binds weakly to the DAP complex in 
A71-78 mice. Levels of a-sarcoglycan were similar in all lines tested, and provided 
an internal control for protein loading. * 

Since some DAP complex members exhibited isoform changes in A71-78 mice, 
we examined purified microsomes from dystrophin A71-74 and A75-78 mice. 
Transgenic mdx mice that express the dystrophin isoform Dp71 in muscle were also 
included in this study since these dystrophic mice have the DAP complex present at . 
the sarcolemma. a 1 -syntrophin levels were lower in all four transgenic lines 
compared with wild-type mice. Surprisingly, Bl -syntrophin was absent in A71-74 
microsomes but was highly overexpressed in A75-78 and Dp71 microsomes. The 
A71-74 microsomes had equivalent B2-syntrophin levels when compared with 
wild-type microsomes, but this isoform of syntrophin was reduced in both A75-78 and 
Dp71 microsomes. A pan syntrophin antibody, which detects all three isoforms of 
syntrophin, confirmed the upregulation of syntrophin in A75-78 and Dp71 
microsomes. Similar to A71-78, a-dystrobrevin-1 was elevated in all dystrophin 
transgenic microsome preparations. However, in comparison with wild-type, 
a-dystrobrevin-2 was higher in A71-74 and A75-78, but equal in Dp71 microsomes. 
Contrary to the A71-78 mice, deleting either exons 71-74 or 75-78 restored nNOS to 
wild-type levels. However, Dp71 mice, which lack the NH 2 -terminal and rod domains 
of dystrophin, did not retain nNOS in the microsome fractions. Previous studies have 
also shown that utrophin is upregulated in mdx and Dp71 mice (Ohlendieck et al., 
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Neuron, 7:499-508, 1991). Therefore, utrophin levels were compared in all transgenic 
lines and we found that A71-78, A71-74, and A75-78 mice do not have the elevated 
levels seen in mdx and Dp71 mice. 

EXAMPLE 2 

Construction of AR4-R23, AR2-R21+H3, and AR2-R1 

This example describes the construction of R4-R23 (micro-dysl), AR2-R21+H3 
(micro-dys3), and AR2-R1 (micro-dys2), three sequences with 4 spectrin-like repeat 
encoding sequences. The 'full-length' human dystrophin cDNA that was started with 
was actually a sequence slightly smaller than the true full-length human dystrophin 
cDNA. In particular, the starting sequence, called full-length HDMD (SEQ ID NO:47, 
see Fig. 23) is the same as the wild-type human dystrophin in SEQ ID NO:l, except 
the 3 f 1861 base pairs are deleted (at an Xbal site), and the 39 base pair alternatively 
spliced exon 71 (bases 10432-10470) are deleted. This sequence (SEQ ID NO:47) is 
originally in pBSX (SEQ ID NO:46, See Figs. 21 and 22). 

A. Cloning AR4-R23 

The procedure used for cloning AR4-R23 is outlined in Figure 24. Initially, 
three PCR reactions were performed (employing Pfu polymerase) to create the deletion 
shown in Figure 24. The primers employed in the first reaction were 5' GAA CAA 
GAT TCA CAC AAC TGG C 3' (SEQ ID NO:48), which anneals to 1954-1975 of the 
HDMD clone, and 5* GTT CCT GGA GTC TTT CAA GA T CCA CAG TAA TCT 
GCC TC 3' (SEQ ID NO:49), which is a reversed, tailed primer (the bold sequence 
anneals to 2359-2341 of the HDMD clone, and the underlined sequence anneals to 
9023-9005 the HDMD clone. PCR was conducted employing these primers, and a 425 
bp PCR product was produced. The first primer employed in the second reaction was 
5' GAG GCA GAT TAC TGT GGA T CT TGA AAG ACT CCA GGA AC 3' (SEQ 
ID NO:50), which is the reverse complement primer of SEQ ID NO:49 (the bold-faced 
sequence of SEQ ID NO:50) anneals to 2341-2359 of the HDMD clone in the forward 
direction. The underlined sequence anneals to 9005-9023 of the HDMD clone in the 



forward direction. The other primer employed for the second reaction was 5' TGT 
TTG GCG AGA TGG CTC 3' (SEQ ID NO:51) which anneals to 9413-9396 of 
HDMD in the reverse direction. PGR was conducted employing these primers, and a 
427 bp PCR product was produced. The third reaction employed the products from 
steps 1 and 2 and the outside primers SEQ ID NO:48 and SEQ ID NO:51, producing a 
814 bp fragment by PCR. This fragment was then digested with Ncol and Hindlll to 
produce a 581 bp DNA fragment. 

This 581 bp fragment was then cloned into a 5016 bp Ncol + Hind III 
fragment from the HDMD clone. The 581 bp fragment contained part of repeat 3, all 
of Hinge 2, and part of repeat 24. The Ncol site used in the HDMD clone was 
located at 2055 bp. The Hind III site was located at 9281 bp. The 5016 fragment 
contained the pBSX cloning vector sequence, and the entire 5* UTR, the entire N 
terminus, Hinge 1, Repeats 1, 2, and part of repeat 3 up to the Ncol site of human 
dystrophin. Ligation of the 5016 bp fragment and 581 bp fragment (step 2) was then 
performed to created a 5597 bp sequence. 

Step 3 was then performed to clone a 2.9 kb Hindlll fragment containing part 
of repeat 24, the C terminus, and the 3' UTR (See Fig. 24). The 5' Hindlll site is 
located at 9281 bp of the HDMD clone. The 3' Hindlll site of this fragment is 
derived from pBSX poly linker. This 2.9 kb fragment was cloned into the Hindlll site 
of the product of Step 2 to yield an 8.5 kb plasmid, composed of the AR4-R23 cDNA 
plus pBSX. The entire AR4-R23 cDNA was excised from pBSX with NotI and cloned 
into the NotI site of the HSA expression vector (HSA promoter - VP1 intron - NotI 
site * tandem SV40 poly adenylation site). 

* * 

B. Cloning AR2-R21+H3 

The procedure used for cloning AR2-R21+H3 is outlined in Figure 25. 
Initially, four PCR reactions were performed (employing Pfu polymerase) to create the 
deletion shown in Figure 25. The primers employed in the first reaction were 5' GAT 
GTG GAA GTG GTG AAA GAC 3 (SEQ ID NO:52), which anneals to 1319-1330 of 
the HDMD clone, and 5' CCA ATA GTG GTC AGT CCA GGA GCA TGT AAA 
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TTG CTT TG 3* (SEQ ID NO:53), which is a reverse, tailed primer (the bold-faced 
sequence anneals to 1546-1532 of the HDMD clone and the underlined sequence 
anneals to 7512-7490 of the HDMD clone, PCR was conducted with these primers 
and a 228 bp PCR product was produced. The first primer employed in the second 
reaction was 5* CAA AGC AAT TTA CAT QCT CCT GGA CTG ACC ACT ATT 
GG 3' (SEQ ID NO:54) > which is the reverse complement of SEQ ID NO:53 (the 
bold-faced sequence of SEQ ID NO;54 anneals to 1532-1546 of the HDMD. clone in 
the forward direction, and the underlined sequence anneals to 7512-7490 of the 
HDMD clone in the forward direction. The other primer employed in the second 
reaction was 5' CTG TTG C AG TAA TCT ATG CTC CAA CAT CAA GGA AGA 
TG 3' (SEQ ID NO:55), and the bold-faced sequence anneals to 8287-8270 of the 
HDMD clone, and the underlined sequence anneals to 7612-7593 of the HDMD clone 
as a reverse primer. PCR was performed with these primers, and a 123 bp PCR 
product was produced. The first primer employed in the third reaction was 5' CAT 
CTT CCT TGA TGT TGG AG C ATA GAT TAC TGC AAC AG 3* (SEQ ID 
NO:56), the underlined sequence anneals to 7593-7612 of the HDMD clone in the 
forward direction, and the bold-faced sequence anneals to 8270-8287. The second 
primer employed in the third reaction was SEQ ID NO: 51 (see above), which anneals 
to 9413-9396 in the reverse direction. PCR was performed with these primers, and a 
1143 bp fragment was produced. The fourth reaction employed the products from 
reactions 1,2, and 3 as template, and the outside primers (SEQ ID NO:52 and SEQ ID 
NO:51), and a 1494 bp fragment was produced using Pfu polymerase. 

This 1494 hp fragment was then digested with Muni and Hindlll to produce a 
1270 bp band and cloned into a 4320 bp Muni + Hindlll fragment from the HDMD 
clone. The 1270 bp fragment contained the part of repeat 1, all of hinge 3, repeat 22, 
repeat 23, and part of repeat 24. The 4320 bp fragment contained the 5' UTR of 
HDMD, the N terminus, Hinge 1, and part of repeat 1 and pBSX. The Muni site in 
HDMD is located at base 1409. The Hindlll site is at 9281 bp. Ligation of the 4320 
bp fragment and' the 1270 bp fragment was then performed (See Figure 25) and a 
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4490 bp fragment was produced. Step 3 was performed as describe above for AR4- 
R23 to generate AR2-R21+H3. 

C. Cloning AR2-R21 

The cloning of AR2-R21 was performed essentially the same way as for 
AR2-R21+H3, with the exception of the recombinant PCR reaction to assemble the rod 
domain deletion (See, Figure 26). All other steps are the same. Three PCR reactions 
were performed (using Pfu polymerase) ,to create the deletion. The primers employed 
in the first reaction were SEQ ID NO:52 (see above), and 5' CTO TTG CAG TAA 
TCT ATG ATG TAA ATT GCT TTG 3' (SEQ ID NO:57) f the underlined sequence 
anneals to 8287-8270 of the HDMD clone in the reverse direction, and the bold-faced 

L 

sequence anneals to 1546-1532 of the HDMD clone in the reverse direction. PCR was 
performed with these primers, and a 250 bp product was obtained. The first primer 
employed in the second reaction was 5* CAA AGC AAT TTA CAT CAT AGA TTA 
CTG CAA CAG 3* (SEQ ID NO:58), which is is the reverse complement of SEQ ID 
NO:57 (the bold-faced sequence of SEQ ID NO:58 anneals to 1532-1546 of the 
HDMD clone in the forward direction, and the underlined sequence anneals to 
8270-8287 of the HDMD clone in the forward direction. The other primer employed 
in the second reaction was SEQ ID. NO:51, which anneals to 9413-9396 in the reverse 
direction. PCR was performed with these primers and a 1143 bp product was 
obtained. The third reaction employed the products from reactions 1 and 2 (as 
template) and the outside primers (SEQ ID NO:52 and SEQ ID NO:51), and a 1383 
bp fragment was produced. This fragment was then digested with Muni and Hindlll 
to produce an 1 147 bp fragment containing part of repeat 1, repeat 22, repeat 23, and 
part of repeat 24. This was then cloned into the same Muni + HindlH HDMD 
fragment described for the AR2-R21+H3 clone and all other steps thereafter were the 
same. 
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EXAMPLE 3 
AR4-R23 Deletions 
This example describes the construction of 5' UTR, 3' UTR, and C-terminal 
deletions of AR4-R23 (making it even smaller), as well as the addition of 
polyadenylation and promoter sequences. This example also describes the alteration of 
the Kozak sequence (to become more like that of consensus). 

A. Deletion of the 3' UTR 

In order to delete the 3' UTR, the following two primers were employed 5' 
TCT CTC CAA GAT CAC CTC G 3* (SEQ ID NO:64), which anneals to 9117-9134 
of the HDMD full length clone, and 5' ATG AAG CTT GCG GCC GCA TGC GGG 
AAT CAG GAG TTG 3' (SEQ ID NO:65) (the underlined site is a Hindlll site that 
was included in this primer and the bold-faced type is a NotI site). SEQ ID NO:65 is 
a reverse primer that anneals to 1 1340-1 1322 of HDMD in the 3' UTR. These primers 
cause the deletion of 707 bp of the 3* UTR from the Xbal cloning site located at 
12057 to the end of this primer (SEQ ID NO:65), leaving 113 bp of native 3' UTR, 
and introducing NotI and Hindlll cloning sites. The PCR product obtained using the 
primers corresponding to SEQ ID NOS:64 and 65 on the pAR4-R23 clone was named 
HdysA3'UTR and was saved for use as a template to generate a further deletion of 
exons 71-78 (see part C below). 

B. Deletion of 5 f UTR and Alteration of Kozak Sequence 
A portion of the 5* UTR was deleted (and the Kozak sequence was altered in 
the same step). The 'step 2* clone from cloning of AR4-R23 was utilized (this was the 
the product of ligating the step 1 PCR product into the 5016 bp Ncol and Hindlll 
fragment from the HDMD full-length clone, and this clone contained pBSX backbone 
plus the 5' UTR, N terminus, Hinge 1, Repeats 1, 2, 3, Hinge 2, and part of repeat 24. 
There is an Muni site located in the first repeat at nucleotide 1409 of the HDMD 
cDN A. In addition, there is a NotI site that is polylinker derived at the 5' end of the 
clone. These two sites were employed, Muni + NotI, to clone a new fragment 
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containing a truncated 5' UTR and an altered Kozak sequence as follows. PCR was 
performed, using Pfu polymerase using the following primers. The first primer was 5' 
TA G CGG CCG C GG TIT TTT TTA TCG CTG CCT TGA TAT ACA CTT TCC 
ACC ATG CTT TGG TGG GAA GAA GTA G 3' (SEQ ID NO:59). We created a 
NotI site (underlined) in this primer so the product could be cloned back into the NotI 
site from the polylinker. The sequence immediately 3' to this NotI site corresponds to 
the dystrophin 5' UTR sequence (the original Kozak sequence was changed with this 
primer, from TCAAAATGC, changed to CCACCATGC. The second primer was 5' 
TTT TCC TGT TCC AAT CAG C 3' (SEQ ID NO:60) which anneals to sequence 
1441-1423 of HDMD full length clone. The final product of this reaction was 1270 
bp and was digested with Notl+MunI to produce a 1233 bp fragment that was then 
cloned into the NotI (polylinker) + Muni site in Repeat 1 of the "Step 2" clones 
(described above for AR4-23). This new clone was named pHDMD5' Kozak. 

C. Deletion of exons 71-78 (C-terminal) 

Using three PCR reactions, a 71-78 deletion was created. We used the Hindlll 
fragment containing the 3 'UTR that was generated by digestion of the HDMD fiill- 
length dystrophin cDNA with Hindlll as the vector to clone the 71-78 fragment into 
the Hindlll site. The primer employed for the first reaction were 5' GGC TTC CTA 
CAT TGT GTC AGT TTC CAT GTT GTC CCC 3* (SEQ ID NO:66), and 5' TCT 
CTC CAA GAT CAC CTC 3' (SEQ CD NO:67) anneals to 9117-9134 of HDMD, 
PCR was performed employing these primers and a 1334 bp product was produced. 
The primers for the second reaction were SEQ ID NO:65, and 5 1 GGG GAC AAC 
ATG GAA ACT GAC ACA ATG TAG OAA GCC 3' (SEQ ID NO:68), where the 
bold-face sequence anneals to exon 70 at 10415-10431 in the forward direction, and 
the underlined sequence anneals to 11216-11233 in the forward direction. PCR was 
performed and a 150 bp fragment was generated. The product of reactions 1 and 2 
were used as template and the outside primers SEQ ID NO:65 and SEQ ID NO:67 
were used to prime the reaction which generated the complete 71-78 C terminus (1484 
bp). This product was digested with Hindlll to produce a 1319 bp fragment and was 
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cloned into the Hindlll site of pTZ19R [See Figure 35). This new clone was named 
pTZ-HDMD-H3A71-78A3. 

D. Cloning of the SV40 pA Sequence into the NOt I site 

The next step was the cloning of the SV 40 pA sequence: 
5'GATCCAGACATGATAAGATACATTGATGAGTTTGGACAAACCACAACTAGA 

ATGCAGTGAAAAAAATGCTTTATTTGTGAAATTTGTGATGCTATTGCTTTATT 
TGTAACCATTATAAGCTGCAATAAACAAGTTAACAACAACAATTGCATTCATT 
TTATGTTTCAGGTTCAGGGGGAGGTGTGGGAGGTTTTTTCGGATC3' (SEQ ID 
NO:71) into the Notl site of the 3' Hindlll fragment that now contains the 3* UTR and 
71-78. A PCR reaction was performed on the template pHSA with a reverse primer 5* 
A GC GGC CGC AAA AAA CCT CCC ACA CCT CC 3' (SEQ ID NO:69, containing 
a regenerating Notl site - underlined) and 5* TAC GGC CGA TCC AGA CAT GAT 
AAG ATA C 3' (SEQ ID NQ:70, containing a destroying EagI site, in bold). All 
other sequence (besides the Notl and EagI sites) is SV40 pA. This PCR reaction 
generated a 195 bp product + cloning sites = 209 bp. We then cloned this fragment 
into the Notl site of pTZ-HDMD-H3A71-78A3 generated by PCR in the 3' UTR clone. 
The upstream (5' - most) Notl site in this clone was destroyed by EagI ligation. This 
new clone was named pTZ-HDMD-H33'A. 

£. Cloning of CK6 promoter into Notl site 

The CK6 promoter - 5' GGT- 
ACTACGGGTCTAGGCTGCCCATGTAAGGAGGCAAGGCCTGGGGACACCCGAG 
ATGCCTGGTTATAATTAACCCCAACACCTGCTGCCCCCCCCCCCCCAACACCT 
GCTGCCTGAGCCTGAGCGGTTACCCCACCCCGGTGCCTGGGTCTTAGGCTCTG 
TACACCATGGAGGAGAAGCTCGCTCTAAAAATAACCCTGTCCCTGGTGGGCC 
CAATCAAGGCTGTGGGGGACTGAGGGCAGGCTGTAACAGGCTTGGGGGCCA 
GGGCTTATACGTGCCTGGGACTCCCAAAGTATTACTGTTCCATGTTCCCGGCG 
AAGGGCCAGCTGTCCCCCGCCAGCTAGACTCAGCACTTAGTTTAGGAACCAG 
TGAGCAAGTCAGCCCTTGGGGCAGCCCATACAAGGCCATGGGGCTGGGCAAG 
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CTGCACGCCTGGGTCCGGGGTGGGCACGGTGCCCGGGCAACGAGCTGAAAGC 
TCATCTGCTCTCAGGGCCCCTCCCTGGGGACAGCCCCTCCTGGCTAGTCACAC 
CCTGTAGGCTCCTCTATATAACCCAGGGGCACAGGGGCTGCCCCCGGGTCAC 

GGGGATCCTCTAGACC-3' (SEQ ID N0:61) was amplified using two tailed 
5 primers: 5' * r,r. one. CGC GOT ACT ACQ GOT CTA GG 3' Forward (SEQ ID 

NO:62), and 5" ATC GGC CGT CTA GAG GAT CCC CGT GAC C 3' Reverse (SEQ 
ID NO:63). The underlined sequence is a NotI site added to the end of this primer. 
The remaining sequence is CK6 sequence. The bold-faced type is an EagI site added 
to the end of this primer. The remaining sequence is from CK6. The CK6 promoter 
,0 - was amplified this way so we could add the NotI and EagI sites (so the entire cassette 
could be excised when put back together with NotI). This PGR product was therefore 
digested whh NotI and EagI and ligated into the NotI site of pHDMD5"Kozak. This 
g new clone was named pCK6HDMD5'Kozak. NotI and EagI produce compatible 

cohesive sites, but when EagI ligates to NotI, it destroys the site. So we placed the 
EagI site at the 3' end, so that when the final construct was cut with NotI, the entire 
| expression cassette could be excised intact. The same strategy was employed at the 3' 

end when placing the SV40 poly A sequence into the 3' Not I site. 
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Q F. Re-ligattng the 5' and 3' ends. 

This step was performed as described above in the micro-dystrophin transgene 
§20 constructs. We reconstituted the same cloning sites but with modifications in the 
fragments, so the modified 3' end, isolated as a Hindlll fragment from clone 
P TZ-HDMD-H33'A (example 3 part D), was able to be cloned into the Hindlll site of 
P CK6HDMD5'Kozak (example 3, part E). This final clone, named 
pCK6R4-R23KozakA3'. contains a truncated dystrophin expression cassette that can be 
25 excised in its entirety by digestion with NotI. This excised expression cassette can 

then be used for a variety of purposes. One such purpose is to clone the cassette into 
a plasmid containing the inverted terminal repeats from adeno-associated virus. By 
cloning the dystrophin expression cassette HDMD-H33'A into a cloning site between 
the two ITRs of AAV, a recombinant AAV vector could be produced. 
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Example 4 

Construction of Reduced Repeat Dystrophin Constructs 

This example describes the construction of AH2-R19 (an 8 spectrin-like-repeat 
sequence), pAR9R16 (a 16 spectrin-like-repeat sequence), pAR!R24 (a zero spectrin- 
like-repeat sequence), pAH2-H3 (a 8 spectrin-! ike repeat sequence), and AH2-R19,R20 
(a 7 spectrin-like repeat sequence). One starting plasmid was pHBMD, a human 
dystrophin cDNA (full-length HDMD, SEQ ID NO:47) with a further deletion of the 
sequences encoded by exons 17-48. The cDNA was cloned into the commercially 
available plasmid vector pTZ19r (MB I Fermentas; Genbank accession number 
Y 1483 5, See Fig.35), into which an EcoRI-Sall adapter (prepared by self-annealing of 
the oligonucleotide 5'-AATTCGTCGACG-3\ SEQ ID NO: 83) had been ligated into 
the the EcoRI site. Base number 1 of the cDNA is immediately 3 1 of the adapter 
sequence, and the cDNA ends at the Xbal site at base 12,100 of SEQ ID NO:l. This 
Xbal site had been ligated into the Xbal site of the plasmid ptZ19r. Another starting 
plasmid is pBSX (SEQ ID NO:46), a modified version of pBluescript KSII+ 
(Stratagene) which is used to make pBSXA (pBSX into which the SV40 
polyadenylation signal (pA) was added). This pA sequence was excised as a 206 bp 
fragment from pCMVfl (Clonetech), blunt-ended with DNA polymerase I, and ligated 
into the blunt-ended Kpnl site of pBSX. Another starting plasmid is pCK3, which is 
pBSX with the 3.3 kb mouse muscle creatine kinase enhancer plus promoter attached 
to the minx intron {See, Hauser et aL, Mol Ther. t 2:16-25, 2000). Another staring 
plasmid is pHDSK, which is pHBMD digested with Kpnl, to remove the dystrophin 
sequences 3' of the internal Kpnl site (base 7,616 of the human dystrophin cDNA 
sequence, SEQ ID NO:l). A further starting vector is p44.1, which is pBluescript KS 
(Stratagene) carrying a human dystrophin cDNA fragment spanning the EcoRI site at 
base 7,002 to the EcoRI site at base 7,875 of the full-length human dystrophin cDNA 
sequence, cloned into the EcoRI site of the vector. Another plasmid employed was 
p30-2, pBluescribe (Stratagene) containing a fragment from the full-length human 
dystrophin cDNA spanning bases 1,455 to the EcoRI site at base 2,647, cloned into 
the EcoRI site of the vector. An additional vector employed was p30-l, pBluescribe 
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cloning site. A second PCR product was generated by amplification of plasmid p44-l 
using primers 5'-CCAGGCTTTACACTTTATGCTTCC-3' (SEQ ID NO:73) and 5'- 
GCACAGATTTCACAGCAGCCTGACCTAGCTC-3' (SEQ ID NO:74). Primer SEQ 
ID NO:74 is the reverse complement of primer SEQ ID NO:75. Primer SEQ ID 
NO:73 corresponds to a sequence in the plasmid vector adjacent to the cloning site. 
The PCR products were then purified by agarose gel electorphoreses, and quantified. 
A recombinant PCR product was then prepared by mixing together 10 ng of each of 
the first two PCR products, then re-PCR amplifying using only primers SEQ ID 
NO:72 and SEQ ID NO-.73. This recombinant PCR product was then digested with 
Nhel and Kpnl, and ligated into Nhel and Kpnl digested pHASK to generate plasmid 
pHBMDAH2 (Nhel cuts at cDNA base 1,519, and Kpnl cuts at base 7,616 of the 
full-length human dystrophin cDNA sequence). pHBMDAH2 was then digested with 
Kpnl and Xbal, and ligated to the KpnI-Xbal fragment from pHBMD (this latter 
fragment contains the full-length human dystrophin cDNA bases 7,616 to 12,100) to 
obtain plasmid pAH2R19. 



Isolation of pAR9R16 

Plasmid p44-l was digested with EcoRI and As P 718 to excise a 610 bp cDNA 
insert, that was ligated into pBSX digested with EcoRI and As P 718, yielding 
pBSX44AE. pBSX44AE was digested with EcoRI and Xbal, and ligated to the 
Nhel-EcoRI cDNA-containing fragment from p30-2, yielding pBSX44AE/30-2NE. 
Plasmid pBSX44AE/30-2NE was linearized by digestion with EcoRI, into which was 
ligated the EcoRI-digested recombinant PCR product AR9-R16. This latter 
recombinant PCR product was generated as follows. Plasmid p30-l was amplified 
with primers SEQ ID NO:72 and 5'-CCATTTCTCAACAGATCTTCCAAAGTCTTG-3 
(SEQ ID NO:77), and plasmid p47-4 was amplified by PCR with primers SEQ ID 
NO:73 and 5'-CAAGACTTTGGAAGATCTGTTGAGAAATGG-3 (SEQ ID N0.76). 
A recombinant PCR product (AR9-R16) was then prepared by mixing together 10 ng 
of each of the first two PCR products, then re-PCR amplifying using only primers 
SEQ ID NO:72 and SEQ ID NO:73. This recombinant PCR product was then 
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digested with EcoRI, and ligated into EcoRI digested pBSX44AE/30-2NE to generate 
plasmid pR9R16int. Plasmid pR9R16int was digested with Ncol and Asp7I8, and the 
3 kb cDNA fragment was isolated and ligated into Ncol and Asp718 digested pHASK 
to generate pAR9R16. 

Isolation of pARlR24 

Plasmid p9-7 was PCR amplified with PCR primers 
5'-AGTGTGGTTTGCCAGCAGTC (SEQ ID NO:80) and 

5^CAAAGTCCCTGTGGGCGTCTTCAGGAGCTTCC-3' (SEQ ID NO:79). Plasmid 
p63-l was PCR amplified with primers 5* 

GGAAGCTCCTGAAGACGCCCACAGGGACTTTG-3' (SEQ ID NO:78) and 
5 , -TGGTTGATATAGTAGGG.CAC-3' (SEQ ID NO:81). A recombinant PCR product 
(AR1-R24) was then prepared by mixing together 10 ng of each of the first two PCR 
products, then re-PCR amplifying using only primers SEQ ID NO:80 and SEQ ID 
NO;8l. This recombinant PCR product was then digested with SexAI and PpuMI, and. 
ligated into SexAI and PpuMI digested pHBMD to generate plasmid pARlR24. 

Isolation of pAH2-H3 

This clone was prepared exactly as pAH2-R19, except that primer 5'- 
CAGATTTCACAGGCTGCTCTGGCAGATTTC-S' (SEQ ID NO:82) was used in 
place of primer SEQ ID NO:74, and primer 

5'-GAAATCTGCCAGAGCAGCCTGTGAAATCTG-3' (SEQ ID NO:84) was used in 
place of primer SEQ ID NO:75. 

Isolation of AH2-R19,R20 

This clone was generated from clone pAH2R19 as follows. Plasmid p44-l was 
amplified with primers SEQ ID NO:72 and 5*- 

TGAATCCTTTAACATAGGTACCTCCAACAT-3' (SEQ ID NO:85). Plasmid 63-1 
was amplified with primers 5 l -ATGTTGGAGGTACCTATGTTAAAGGATTCA-3' 
(SEQ ID NO:86) and SEQ ID NO:8L The PCR products were then purified by 
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agarose gel electorphoreses, and quantified. A recombinant PCR product was then 
prepared by mixing together 10 ng of each of the first two PCR products, then re-PCR 
amplifying using only primers SEQ ID NO:72 and SEQ ID NO:81. This product was 
digested with Asp718 and BstXI, and ligated into Asp718 and BstXI digested pHBMD 
5 generating clone pBMDAR20. The Asp718-Xbal cDNA-containing fragment from 
pBMDAR20 was isolated and ligated into Asp7l8 and Xbal digested pAH2R19 to 
generate pAH2-R19,R20. 

EXAMPLES 
Testing Truncated Dystrophin in mdx Mice 

10 This example describes the generation of transgenic mdx mice expressing 

truncated dystrophin cDNA (see above), and testing these mice in various ways to 
q determine various measurable muscle values. A variety of dystrophin expression 



fU 



CO 



cassettes (Fig. 27) were used to generate transgenic mice to test their functional 



Mi capacity in alleviating muscular dystrophy on the dystrophin null mdx background. 

w 

£15 Figure 27 depicts the truncated dystrophin cDNA sequences tested, all of which were 



linked to an regulatory regions, a minx intron, and the SV40 polyadenylation sequence 



& (the 4-repeat constructs employed the HSA actin promoter, See Crawford et aL, J. 

Q Cell. BioL t 150:1399, 2000; and the remaining sequences employed an MCK enhancer 

P and promoter, see Niwa et aL, Genes Dev. 4:1552, 1990). Each of these constructs 

□20 was released by digestion from plasmid hosts, were gel purified, and used to generate 
transgenic mice. 

Excised expression cassettes injected into wild type C57B1/10 x SJL/J F2 
hybrid embryos, and F° mice were screened by PCR analysis of DNA isolated from 
tail snips. Positive F° mice were backcrossed onto the C57Bl/10m<& background, and 
25 individual mouse lines were tested for dystrophin expression by immunofluorescent 
analysis with dystrophin antibodies for of expression in skeletal muscle fibers. Lines 
that displayed uniform expression of dystrophin in muscle fibers were selected for 
further analysis. These lines were further backcrossed onto the mdx mouse 
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background before analysis of dystrophin expression, muscle function and morphology. 

A. Truncated dystrophin cDNAs are expressed at various levels in muscles 
of transgenic mdx mice. 

Muscle. extracts were analyzed by western (immuno) blot analysis to determine 
the amount of dystrophin made in different muscles of the transgenic mdx mice. For 
these studies, total protein was extracted from the quadriceps and diaphragm muscles 
of control and transgenic mice, and protein concentrations were determined using the 
Coomassie Plus Protein Assay Reagent (Pierce). One hundred micrograms of each 
sample was electrophoresed on a 6% polyacrylamide/SDS gel (29.7:0. 3/acryl:bis), 
transferred for 2 hours at 75 volts onto Biotrace Nitrocellulose (Gelman Science) in 
IX Tris-Glycine, 20% methanol, 0.05% SDS, using a wet-transfer apparatus (Hoefer). 
Membranes were blocked in 10% non-fat dry milk, 1% normal goat serum, and 0.1% 
Tween-20, and hybridized with DYS1 (Novacastra) at a 1/1000 dilution for 2 hours at 
room temperature, washed, and then probed with horse radish peroxidase conjugated 
anti-mouse antibodies at a 1/2,000 dilution (Cappel). Blots were developed using the 
ECL chemiluminescence system (Amersham). All incubations contained 1% normal 
goat serum and 0.1% Tween-20. The results of the western blot indicated that R9-R16 
was poorly expressed in this line of mice, especially in the diaphragm, and that H2-H3 
was very poorly expressed in the diaphragm. 

B. Truncated dystrophin cDNAs confer various degrees of protection 
on muscles of transgenic mdx mice. 

Various muscle groups from the different lines of transgenic mice expressing 
truncated dystrophins were examined for morphological abnormalities, and for 
expression of dystrophin by indirect immunofluorescence (IF) in individual fibers. IF 
analysis was performed as follows. Skeletal muscle was removed from control and 
transgenic animals, cut into strips, embedded in Tissue-tek OCT mounting media 
(Miles, Inc.), and frozen quickly in liquid nitrogen-cooled isopentane. Seven 
micrometer sections were* blocked with 1% gelatin in KPBS for 15 minutes, washed in 
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KPBS + 0.2% gelatin (KPBSG), and incubated for 2 hours in KPBSG + 1% normal 
goat serum with affinity-purified dystrophin antibody 18-4 (Cox et aL, Nature, 
364:725-729, 1993) at a dilution of 1/1000. After washing, the slides were incubated 
for I hour with either biotin-labeled goat anti-rabbit polyclonal antibodies (Pierce), 
washed again, and incubated with FITC (fluorescein tsothiocynate)-CQnjugated 
streptavidin. After a final wash, Vectashield (Vector Laboratories, Inc.) with DAPI 
was applied and sections were photographed through a dual bandpass filter under 40X 
magnification using a Nikon El 000 microscope. 

Morphological analysis of the muscles was performed as follows. Muscle 
groups from among the following types were chosen for analysis: Quadriceps (Quad), 
soleus, extensor digitorum longus (EDL), tibialis anterior (TA), and diaphragm 
muscles. Selected muscles were removed from mice, frozen in liquid nitrogen cooled 
O.C.T. embedding medium (Tissue-Tek), and cut into 7 \im sections. After fixing in 
3.7% formaldehyde, sections were stained in hematoxylin and eosin-phloxine. Stained 
sections were imaged with a Nikon El 000 microscope and photographed. 

The results of this analysis show that micro-dystrophin expression (AR4R23 
transgene) in the diaphragm prevents the onset of muscular dystrophy in mdx mice. In 
particular, micro-dystrophin transgenic and wild-type C57B1/10 diaphragm sections 
stained with hematoxylin and eosin (H&E) show morphologically healthy muscle . 
without areas of fibrosis, necrosis, mononuclear cell infiltration, or centrally located 
nuclei. Conversely, the mdx diaphragm displays a high level of dystrophic 
morphology by H&E. Also, immuno -fluorescence, using anti-dystrophin polyclonal 
primary antisera, demonstrates that micro-dystrophin transgenes are expressed at the 
sarcolemmal membrane in a similar fashion to that of wild-type dystrophin, while mdx 
mice do not express dystrophin. 

H&E staining also shows that truncated dystrophins with 8 or 16 spectrin-like 
repeats have varying abilities to prevent dystrophy in the diaphragm of transgenic mdx 
mice. The H2R19 maintains normal muscle morphology that is not different from 
wild-type. C57B 1/10 muscle. The AH2R19 muscle displays a very low percentage of 
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centrally nucleated fibers, while the AH2-R19,R20 and AR9-16 constructs display 
percentages intermediate between AH2-R19 and mdx (see Fig. 28). The mdx 
diaphragm had a large number of centrally nucleated fibers, many necrotic fibers, and 
large areas of mono-nuclear cell. infiltration and fibrosis. 

The results also show that quadriceps muscle fibers expressing 
micro-dystrophin transgene (AR4R23 transgene) display normal morphology and 
exclude Evans Blue Dye. Micro-dystrophin transgenic mdx or C57B1/10 quadriceps 
sections stained with hematoxylin and eosin (H&E) display morphologically healthy 
muscle without areas of necrosis, fibrosis, mononuclear cell infiltration, or 
centrally-located nuclei, as opposed to sections of mdx muscle. The high abundance" 
of central nuclei and mononuclear immune cell infiltration are evidence of muscle cell 
necrosis. Immunofluorescence results indicate that micro -dystrophins display a 
subsarcolemmal expression pattern like that of -wild-type dystrophin, while mdx mice 
do not express dystrophin, Evans Blue Dye (EBD) uptake is an indication of a 
damaged myo fiber. For analysis of EBD uptake, mice were tail vein injected with 150. 
ul of a solution containing 10 mg/ml Evans blue dye in PBS (150 mM NaCl, 50 mM 
Tris pH 7.4). . After three hours, the animals were euthanized and mouse tissues were 
either fixed in 3.7% fonnaldehyde/0. 5% glutaraldehyde to observe gross dye uptake, or 
frozen unfixed in O.C.T. embedding medium. To examine Evans blue uptake by 
individual fibers, 7 urn thick frozen sections were fixed in cold acetone and analyzed 
by fluorescence microscopy. The results of this testing indicate that fibers expressing 
micro-dystrophin or wild-type dystrophin exclude EBD, and that damaged mdx muscle 
cell membranes are permeable to Evans Blue dye. 

A hallmark of dystrophy in mdx mice is the presence of large numbers of 
centrally-nucleated muscle fibers, reflecting cycles of fiber degeneration and 
regeneration. To estimate the degree of myofiber regeneration occurring in the 
transgenic mice, centrally-nucleated fibers were counted from quadriceps muscles in 
age-matched wild-type, mdx, and transgenic mdx mice (Fig. 28). To determine the 
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percentage of fibers containing central nuclei, the number of muscle fibers with 
centrally-located nuclei was divided by the total number of muscle fibers. 

Expression of 8 or 4 repeat micro-dystrophin tranagenes on the mdx 
background significantly reduces the percentage of fibers with centrally-located nuclei 
to wild-type or near wild-type levels (Fig. 28). Dystrophin molecules with zero 
repeats are unable to correct the mdx phenotype by this assay. The best constructs 
were observed to be the 8 repeat H2-R19 and the 4 repeat R2-R23 constructs. Greater 
percentages of centrally nucleated fibers were observed in mice expression the exon 
17-48 deletion, the 4 repeat R2R21 construct, the 7 repeat H2R19,R20 construct, the 
16 repeat R9R16 construct, and the zero repeat R1R24 construct (Fig. 28). The results 
from the R9R16 construct likely do not reflect the full functional capacity of the 16 
repeat dystrophin since this line of mice expressed very low levels of the truncated 
dystrophin protein. All other muscles expressed -levels of dystrophin that have been 
shown to be capable of preventing dystrophy if the expressed protein is functional 
(Phelps et al, Hum Mol Genet; 4:1251-1258, 1995). 

The functional capacity of the truncated dystrophins was also assessed by 
measuring muscle contractile properties in the transgenic mdx mice. Contractile 
properties of muscles from transgenic mice were compared with those of C57B1/10 
wild type and mdx mice. The samples included 4-8 muscles each from the tibialis 
anterior (TA), extensor digitorum longus (EDL) or diaphragm. Mice were deeply 
anesthetized with avertin and each muscle was isolated and dissected free from the 
mouse. After removal of the limb muscles, the mice were euthanized with the 
removal of the diaphragm muscle. The muscles were immersed in a bath filled with 
oxygenated buffered mammalian Ringer's solution (137 mM NaCl, 24 mM NaHC0 3 , 
11 mM glucose, 5 mM KC1, 2 mM CaC^, 1 mM MgS0 4 , 1 mM NaH 2 P0 4 , and 0.025 
mM rubocurarine chloride, pH 7,4). For each muscle, one tendon was tied to a 
servomotor and the other tendon to a force transducer. Muscles were stretched from 
slack length to the optimal length for force development and then stimulated at a 
frequency that produced absolute isometric tetanic force (mN). Following the 
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measurements of the contractile properties, the muscles were removed from the bath, 
blotted and weighed to determine muscle mass. Specific force (kN/m2) was calculated, 
by dividing absolute force by total fiber cross sectional area. 

Figure 29 shows that the 8 repeat dystrophin encoded by H2-R19 supports 
normal force development in both the diaphragm (Fig. 29a) and EDL muscle (Fig. 
29b). In contrast, previous studies showed that the exon 17-48 construct, which 
encodes a dystrophin with 8.25 spectrin-like repeats, supports only 90-95% of normal 
force development in the diaphragm (Phelps et aL, Hum Mol Genet, 4:1251-1258, 
1995). The 8 repeat dystrophin lacking a central hinge (H2-H3), and the 7 repeat 
dystrophin (H2-R19,R20) both fail to support significant force generation compared 
with dystrophic mdx muscles. The results from the R9-R16 construct likely do not 
reflect the full functional capacity of the 16 repeat dystrophin, since this line of mice 
expressed very low levels of the truncated dystrophin. 

Figure 30 shows that the micro-dystrophin transgenic mdx mice develop less 
specific force than do C57B1/10 mice in the TA, but near wild-type levels in the 
diaphragm. Micro-dys 1 and -2 refer to transgenes AR4-R23, and AR2-R21, 
respectively. Figure 30A shows that C57B1/10 mice display significantly higher 
specific force than both transgenic lines and mdx mice in the tibialis anterior (TA) 
muscle. Data are presented as means ± standard error of the means (s.e.m.) with each 
baT representing 6 to 8 TA muscles. ANOVA statistical testing was performed. (* 
indicates significance from C57BV10, p<0.01; s indicates significance from. C57B 1/10, 
p<0.05). Figure 30B shows that mice expressing Micro-dys 1 develop wild type levels 
of specific force in the diaphragm, while mice expressing Micro-dys 2 develop ~22% 
less specific force by the same assay when compared with C57B1/10. Both lines of 
mice develop more specific force than mdx mice in the diaphragm. Data are presented 
as. the percentage of wild type. 

Dystrophic mice are susceptible to contraction-induced injury (Petrof, et aL, 
Proc. Natl. Acad. Sci. USA. 90:3710-3714, 1993). In this part of the example tested 
whether the 4 repeat dystrophin clones would protect muscles of transgenic mdx mice 
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from contraction induced injuries. To test contraction-induced injury, an experimental 
protocol consisting of two muscle stretches was performed in live, anesthetized 
animals. The distal tendon of the TA was cut and secured to the lever arm of a 
servomotor that monitors position and force produced by the muscle. Stimulation 
5 voltage and optimal muscle length (L 0 ) for force production were determined. The 

muscle was maximally stimulated and then stretched 40% greater than L 0 (LCI) for 
300 milliseconds. A second lengthening contraction was performed 10 seconds later 
(LC2). The maximum force that the muscle was able to produce after each stretch 
was measured and expressed as a percentage of the force produced before stretch. 
10 Mdx mice expressing micro -dystrophins were significantly protected from the dramatic 
force deficit produced after a lengthening contraction compared with mdx mice (Fig. 
31). Micro-dys 1 and -2 refer to transgenes AR4-R23, and AR2-R21, respectively. 

£H ■ 

Q Furthermore, there was no significant difference between either micro-dystrophin 

JJj construct studied in this assay and C57B1/10 mice following the second, most 

Kl5 damaging lengthening contraction. Data are presented as means ± s.e.m. with each bar 

£0 

representing between 6 and 8 TA muscles from 9-11 week old mice. 



® 

9 

K C. Truncated 4 repeat dystrophin cDNAs restore the ability to run 

□ 

□ long distances to mdx mice. 

^ We have observed that mdx mice are not able to run for long distances on a 

Q20 treadmill, as compared to wild-type mice (see below). Therefore, mice expressing four 
repeat dystrophins were compared with wild-type and mdx mice for ability to run for 
extended times on a treadmill. The exercising protocol utilized a six lane, enclosed 
treadmill with a shock grid to allow forced running at a controlled rate. C57B1/10, 
C57B1/6 x SJL Fl, mdx or transgenic mdx mice were run at a 15 degree downward 
25 angle to induce damaging eccentric muscle contractions. Mice were given a 1 5 minute 

acclimation period prior to exercise, and then ran at 10 meters/minute with a 
subsequent 5 m/min increase in rate every 10 minutes until exhaustion. Exhaustion 
was determined to be the time at which a mouse spent more than 5 seconds sitting on 
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the shock grid without attempting a re-entry to the treadmill. As shown in Fig 32, 
both lines of four repeat transgenic mice ran significantly farther than mdx mice. 
Micro-dys 1 and -2 refer to transgenes AR4-R23, and AR2-R21, respectively. 
Micro-dystrophin transgenic mice are a genetic mixture of C57B1/6 x SJL, and 
C57BI/10 strains, and ran an intermediate distance between the two wild-type lines. 
Data are presented as means ± s.e.m. ANOVA statistical analyses were performed. (* 
indicates values significantly different from mdx line, p<0.01; s' indicates values 
significantly different from mdx line, p<0.05). 

D. Micro-dystrophin transgenic mdx mice do not display hypertrophy 

As a way to measure the functional capacity of the four-repeat dystrophins, we 
weighed both whole mice and dissected tibialis anterior muscles from age matched 
transgenic and control mice. The results shown in Fig. 33 show that the 
micro-dystrophin transgenic mdx mice do not display the muscle hypertrophy normally 
observed in mdx mice. Figure 33 A shows that three month old micro-dystrophin 
transgenic mdx mice weighed significantly less than age-matched mdx control mice. 
Figure 1 1 B shows that tibialis anterior (TA) muscle masses in mdx mice were 
significantly higher than control muscle masses in C57B1/10 and in both lines of mdx 
mice expressing different micro-dystrophin transgenes. Data are presented as means ± 
s.e.m. with each bar representing between 3 and 4 mice. ANOVA statistical analyses 
were performed (* indicates difference from mdx line, p<0.01; Y indicates difference 
from C57B1/10 line, p<0.01; s indicates difference from C57B1/10 line, p<0.05). . 
Micro-dys 1 and -2 refer to transgenes AR4-R23, and AR2-R21, respectively. 

EXAMPLE 6 

Microdystrophin-containing Adeno-associated Viral Vectors 

This example describes a construct that could be made in order to allow adeno- 
associated virus to express a mini-dystrophin peptide in a target muscle cells. Fig. 34 
shows a schematic illustration of a plasmid vector containing the adeno-associated 
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virus inverted terminal repeats (AAV-ITRs), the muscle promoter plus enhancer 
fragment known as CK6.(SEQ ID NO:6l, the AR2-R21 four repeat dystrophin cDNA 
(SEQ ID NO:40) with a further deletion of sequences encoded on exons 71-78, plus a 
195 base pair SV40 polyadenylation signal that would have a total insert size of 
approximately 4.7 kb. The cloning capacity of adeno -associated viral vectors is 
approximately 4.9 kb. As such, the construct could be efficiently packaged into AAV 
viral particles (e.g. this plasmid construct could be used to transfect cells such that 
AAV expressing mini-dystrophin peptide is expressed). These AAV then, for 
example, may be administered to a subject with DMD or BMD (i.e. gene, therapy to 
correct a muscle deficiency in a subject). 

All publications and patents mentioned in the above specification are herein 
incorporated by reference. Various modifications and variations of the described 
method and system of the invention will be apparent to those skilled in the art without 
departing from the scope and spirit of the invention. Although the invention has been 
described in connection with specific preferred embodiments, it should.be understood 
that the invention as claimed should not be unduly limited to such specific 
embodiments. Indeed, various modifications of the described modes for carrying out 
the invention which are obvious to those skilled in material science, chemistry, and 
molecular biology or related fields are intended to be within the scope of the following 
claims. 
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CLAIMS 

We claim: 

1. A composition comprising nucleic acid encoding a mini-dystrophin 
peptide, wherein said mini-dystrophin peptide comprises a spectrin-like repeat domain, 
and wherein said spectrin-like repeat domain consists of n spectrin-like repeats, 
wherein n is an even number less than 24, 

2. The composition of Claim 1, wherein said mini-dystrophin-peptide is 
capable of altering a measurable muscle value in a DMD animal model by at least 
20% of the wild type value. 

3. The composition of Claim 1, wherein said mini-dystrophin peptide is 
capable of altering a measurable muscle value in a DMD animal model to a level 
similar to the wild-type value. 

4. The composition of Claim 1, wherein n is a multiple of 4. 

5. The composition of Claim 1, wherein n is 4. 

6. The composition of Claim 1, wherein said nucleic acid comprises an 
expression vector. 

7. The composition of Claim 1, wherein said nucleic acid comprises 
spectrin-like repeat encoding sequences. 

8. The composition of Claim 7, wherein said spectrin-like repeat encoding 
sequences are precise spectrin-like repeat encoding sequences. 
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9. The composition of Claim 1 , wherein said nucleic acid comprises an 
actin-binding domain encoding sequence. 

10. The composition of Claim 9, wherein said actin binding domain 
comprises at least a portion of SEQ ID NO:6, 

11. The composition of Claim 1, wherein said nucleic acid comprises a p- 
dystroglycan binding domain. 

12. The composition of Claim 11, wherein said (3-dystroglycan binding 
domain comprises at least a portion of a dystrophin hinge 4 encoding sequence, and at 
least a portion of a dystrophin cysteine-rich domain encoding sequence. 

13. The composition of Claim 7, wherein said spectrin-like repeat encoding 
sequences are selected from the group consisting of SEQ ID NOS;8-10, 12-27, and 29- 
33. 

14. The composition of Claim 1, wherein said nucleic acid contains less 
than 75% of a wild type dystrophin 3' untranslated region. 

15. The composition of Claim 1, wherein said mini -dystrophin peptide 
further comprises a substantially deleted dystrophin C- terminal domain. 

16. The composition of Claim 1, wherein said nucleic acid sequence 

* 

contains less than 50% of a dystrophin 3' untranslated region. 

17. A method of expressing a mini -dystrophin peptide in a target cell, 
comprising; 

a) providing; 
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i) a vector comprising nucleic acid encoding a mini- 
dystrophin peptide, wherein said mini-dystrophin peptide comprises a 
spectrin-like repeat domain, and wherein said spectrin-like repeat 
domain consists of n spectrin-like repeats, wherein n is an even number 
less than 24, and 

ii) a target cell, and 

b) contacting said vector with said target cell under conditions 
such that said mini-dystrophin peptide is expressed in said target cell. 

18. The method of Claim 17, wherein said mini-dystrophin peptide further 
comprises a substantially deleted dystrophin C-terminal domain. 

19. The composition of Claim 17, wherein said nucleic acid comprises 
spectrin-like repeat encoding sequences. 

20. The method of Claim 19, wherein said spectrin-like repeat encoding 
sequences are precise spectrin-like repeat encoding sequences. 

21. The composition of Claim 17, wherein said mini-dystrophin-peptide is 
capable of altering a measurable muscle value in a DMD animal model by at least 
20% of the wild type value. 

22. The composition of Claim 17, wherein said mini-dystrophin peptide is 
capable of altering a measurable muscle value in a DMD animal model to a level 
similar to the wild-type value. 

23. A composition comprising nucleic acid, wherein said nucleic acid 
encodes a mini-dystrophin peptide, and wherein said mini-dystrophin peptide 
comprises a substantially deleted dystrophin C-terminal domain. 
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24. The composition of Claim 23, wherein said substantially deleted 
dystrophin C-terminal domain is less than 40% of a wild type dystrophin C-terminal 
domain. 

25.. The composition of Claim 23, wherein said mini-dystrophin-peptide is 
capable of altering a measurable muscle value in a DMD animal model by at least 
20% of the wild type value. 

26. The composition of Claim 23, wherein said mini-dystrophiri peptide is 
capable of altering a measurable muscle value in a DMD animal model to a level 
similar to the wild-type value. 

27. The composition of Claim 23, wherein said nucleic acid comprises an 
actin-binding domain encoding sequence. 

28. The composition of Claim 23, wherein said nucleic acid comprises a p- 
dystroglycan binding domain. 

29. The composition of Claim 23, wherein said nucleic acid comprises at 
least 2 spectrin-like repeat encoding sequences. 

30. The composition of Claim 23, wherein said nucleic acid comprises viral 
nucleic acid. 
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FIGURE 1 (Human Dystrophin cDNA, Ace No. M18533, SEQ ID N0:1) 



.1 gggattccct cactttcccc ctacaggact 
61 aaacgaatag gaaaaactga agtgttactt 
121 tcattgtttt taagectact ggagcaataa 
181 atcgctgcct tgatatacac ttttcaaaat 
241 tgaaagagaa gatgttcaaa agaaaacatt 
301 gtttgggaag cagcatattg agaacctctt 
361 agacctcctc gaaggcctga cagggcaaaa 
421 tcatgccctg aacaatgtca acaa^gcact 
481 agtgaatatt ggaagtactg acat'egtaga 
541 ttggaatata atcctccact ggcaggtcaa 
601 gcaacaaacc aacagtgaaa agattctcct 
661 tccacaggtt aatgtaatca acttcaccac 
721 tctcatccat agtcataggc cagacctatt 
781 agccacacaa cgactggaac atgcattcaa 
841 actactcgat cctgaagatg ttgataccac 
901 catcacatca ctcttccaag ttttgectea 
961 ggaaatgttg ccaaggccac • ctaaagtgac 
1021 aatgeactat tctcaacaga teaeggtcag 
1081 ccctaagcct cgattcaaga getatgecta 
1141" ccctacacgg agcccatttc cttcacagca 
1201 cagttcattg atggagagtg aagtaaacct 
1261 attatcgtgg cttctttctg ctgaggacac 
1321 tgtggaagtg gtgaaagacc agtttcatac 
1381 ccatcagggc cgggttggta- atattctaca 
1441 attatcagaa gatgaagaaa ctgaagtaca 
1501 ggaatgeetc agggtagcta gcatggaaaa 
1561 tctccagaat cagaaactga aagagttgaa 
1621 aaggaaaatg gaggaagagc ctcttggacc 
1681 acaacataag gtgcttcaag aagatctaga 
1741 tcacatggtg gtggtagttg atgaatctag 
1801 acaacttaag gtattgggag atcgatgggc 
1861 ggttctttta caagacatcc ttctcaaatg 
1921 tagtgcatgg ctttcagaaa aagaagatgc 
1981 agatcaaaat gaaatgttat caagtcttca 
2041 aaagaaaaag caatccatgg gcaaactgta 
2101 gaagaataag tcagtgaccc agaagaegga 
2161 ggataattta gtccaaaaac ttgaaaagag 
2221 cactcagcca tcactaacac agacaactgt 
2281 ggaacagatc ctggtaaagc atgetcaaga 
2341 gaggcagatt actgtggatt ctgaaattag 
2401 tcacagctgg attactcget cagaagctgt 
2461 gaaggaaggc aacttctcag acttaaaaga 
2521 tgagaagttc agaaaactgc aagatgccag 
2581 ggtgaatgag ggtgttaatg cagatagcat 
2641 gtggatcgaa ttctgccagt tgctaagtga 
2701 catcatcgct ttctataatc agctacaaca 
2761 ctggttgaaa atccaaccca ccaccccatc 
2821 aatttgtaag gatgaagtca accggctatc 
2881 aattcaaagc atagecctga aagagaaagg 
2941 tgtggccttt acaaatcatt ttaagcaagt 
3001 gctacagaca atttttgaca* ctttgccacc 
3061 caggacatgg gtccagcagt cagaaaccaa 
3121 ctatgaaatc atggagcaga gaetegggga 
3181 gcaacaaagt ggectatact atctcagcac 
3241 ctctgaaatt ageeggaaat atcaatcaga 
3301 gctctcctcc cagctggttg agcattgtca 
3361 aaaaattcag aatcacatac aaaccctgaa 
3421 gaaggaggaa tggcctgccc ttggggattc 
3481 cagactttta gtcagtgata ttcagacaat 
3541 tgggcagaag ataaagaatg aagcagagee 
3601 caaagaactt aacactcagt gggatcacat 



cagatctggg aggcaattac etteggagaa 
tttttaaagc' tgctgaagtt tgttggtttc 
agtttgaaga acttttacca ggtttttttt 
gctttggtgg gaagaagtag aggactgtta 
cacaaaatgg gtaaatgeae aattttctaa 
cagtgaccta caggatggga ggcgcctcct 
actgecaaaa gaaaaaggat ccacaagagt 
gcgggttttg cagaacaata atgttgattt 
tggaaatcat aaactgactc ttggtttgat 
aaatgtaatg aaaaatatca tggctggatt 
gagctgggtc cgacaatcaa ctegtaatta 
cagctggtct gatggcctgg etttgaatge 
tgactggaat agtgtggttt gccagcagtc 
catcgccaga tatcaattag gcatagagaa 
ctatccagat aagaagtcca tcttaatgta 
acaagtgagc attgaageca tccaggaagt 
taaagaagaa cattttcagt tacatcatca 
tctagcacag ggatatgaga gaacttcttc 
cacacaggct gcttatgtca ccacctctga 
tttggaagct cctgaagaca agtcatttgg 
ggaccgttat caaacagctt tagaagaagt 
attgeaagea caaggagaga tttctaatga 
tcatgagggg tacatgatgg atttgacagc 
attgggaagt aagctgattg gaacaggaaa 
agagcagatg aatctcctaa attcaagatg 
acaaagcaat ttacatagag ttttaatgga 
tgactggcta acaaaaacag aagaaagaac 
tgatcttgaa gacctaaaac gecaagtaca 
acaagaacaa gtcagggtca attctctcac 
tggagatcac geaactgetg ctttggaaga 
aaacatctgt agatggacag aagaccgctg 
geaaegtett actgaagaac agtgcctttt 
agtgaacaag attcacacaa ctggctttaa 
aaaactggcc gttttaaaag eggatctaga 
ttcactcaaa caagatcttc tttcaacact 
agcatggctg gataactttg cccggtgttg 
tacagcacag atttcacagg ctgtcaccac 
aatggaaaca gtaactaegg tgaccacaag- 
ggaacttcca ccaccacctc cccaaaagaa 
gaaaaggttg gatgttgata taactgaact 
gttgcagagt cctgaatttg caatctttcg 
aaaagtcaat gecatagage gagaaaaagc 
cagatcagct caggccctgg . tggaacagat 
caaacaagcc tcagaacaac tgaacagccg 
gagacttaac tggctggagt atcagaacaa 
attggagcag atgacaacta ctgctgaaaa 
agagecaaca gcaattaaaa gtcagttaaa 
aggtcttcaa cctcaaattg aacgattaaa 
acaa.ggaccc atgttcctgg atgeagaett 
cttttctgat gtgeaggeca gagagaaaga 
aatgegctat caggagacca tgagtgccat 
actctccata cctcaactta gtgtcaccga 
attgeagget ttacaaagtt etctgeaaga 
cactgtgaaa gagatgtcga agaaagegee 
atttgaagaa attgagggac gctggaagaa 
aaagctagag gagcaaatga ataaactccg 
gaaatggatg gctgaagttg atgtttttct 
agaaattcta aaaaagcagc tgaaacagtg 
tcagcccagt ctaaacagtg tcaatgaagg 
agagtttget tcgagacttg agacagaact 
gtgccaacag gtctatgeca gaaaggaggc 



FIGURE 1 (cont) 



372 6 1 atSlS'HS^S ttlTolltlt -tccagaaa.gatctatcag agatgcacga 

3781 tgSlttlcg-SSSS"? aagagatgaa altlaltllt tltll^ aaac ' cca 9 a 
3841 agcgaaaota aaaetect-f-* I 9agagctaaa gaagaggccc aacaaaaaga 

3901 agcicaagag gccttallaa ffiEfJ 9 * aaa J«9 t fltc atagctcaag ctccacctgt 
3961 cactagcltq MtooSS*-? *-H 9 aac "? a aactctaacc accaactacc agtggctctq 
4021 at t ttgllltttl ISlfZtSt Sgaagaagtt tgggcatgtt g^I?gagtt 

4081 cactgaaaac attcctggcg qaqcKf I-^ 9 " 95 a 9 aa "ta aacttaalac 
4141 tttgatgcga cattcaaa™ Itt^ll aatct ftgag gtgctagatt cacttgaaaa 
4201 agatgglgga g'oatggatg agc'aatca 3 ?S'2« ttC ?S atatt 99= a c agaccctaac 
4261 gagggaacta catcaaa*™ j^;! tgaggaactt gagacattta attctcgttq 

4 321 tglclaggag tltltlllll cc^aS? gcaaaa 9ttg cttgaacaga gcatcclgtl 
4 381 gcagttggca oottatat?™ ^Hf? fc 5 aatccaggag tccctcacat tcattgacaa 
4441 laaLtl^a 3 ?ctgaUtgf-caIgtcatIa SaK"?? 1 caaat ^<*c aggaafccca 
4501 tcaggggaag gaggctgccc aatgagtc?? g^ctclolt 3 a 9 aaa «t aa 
4561 acaagatgtc tccatgaaqt ttcqatr*7; llH, 9 9atgttgcac agaaaaaatt 
4 621 acaagaaagt aaqatqattr t^ffJ^? t ccagaaacca gccaattttg agctgcgtct 

^ee/gagtgtggL' «|SfS," ta a c 9 a a g?cala ?c?Sifcft - ^r?' 9 "! ^alaSS 
4741 tctgagtgaa gtglagtctq aaStooaaa? ^;!! a tgtgtgaact tgtataaaag 
4801 gaaaaagcag aclgalaat? IcltEt^t ?gtgataaag actggacgtc agattgtaca 
4861 ttataatgag ctgScat ' a^af 3 .?™ * gat 9 aaa 9 a gtaacagctt tgaaattgca 
4921 attgtcccgt aagltgcgaa aa^aat*.?* tTlt*"** ca 9«99aga aatgcttgaa 
4 961 tatggaattg aclaagagat cfqcaaUaa tt^t^ 9aatggctgg cagctacaga 
5041 tgcctggggi aaggctactc aaaafl^ff a 99 a atgcct agtaatttgg attctgaagt 
5101 cacaglggta ggl|aggcct tqatalcf^ £?a?aaacag aa 99tgcacc tgaagagtlt 
5161 taaactcagt cttctalata nttltl^l "tgggcaag aaggagacgt tggtggaaga 
5221 aaatcttt?g ttgga^cc gaSt ScUtt fS-WS 
5281 cacaaagtgg atcattcagg ctqacacact tttnt^H ^ccagaatg tggaccacat 
5341 gcaaaaagaa gacgtgcttl a«"tttM* ™!?!" tg f a tca 9 a 9aaaa agaaacccca 
5401 ggactctaca cgtgaccaag cfqcaaact 3 f,^**^ 9 aat 9 aca tac gcccaaaggt 
5461 attagtagag cccLaatc? c^agc'caa llalcaaltt atlTritZ act 9«ggaa 
5521 taagactgga aaggcctcca ttZrttrntt f ? " 9 ca gccattt cacacagaat 
. 5581 aaaattgctt gallcactqa aoactaa 93 ? ?9aattggag cagtttaact cagatataca 
5641 cttcaataaa gatatqaato 2 9 2^fff at tca ?cagggg gtgaatctga aagaggaaga 
5701 agacaactta ^acalagal ^clcagatS UtS? 9 ^ aaa 9 aa ttgt tgLSga^ 
5761 gcagctgtta cagacaaaac atiatSff? ^! aaa ?f ga 9aggaaataa agataaaaca 
5821 ggctctagaa atttctcatc w"*otS££ "f 9 9 attt 9 aggtctcaaa glagaaaaaa 
5881 atgcttggat gacattgaaa lllUSlS fflS "ggctgatg atclcctgaa 
5941 aataaaggaa attqatcqoo ca gcctacct gagcccagag atgaaaggaa 

6001 gcaagctgag ggct^qtcta S 9aagaaagag gagctgaltg cagtgcgtaq 

6061 gcteagclal SctS2c«o aff?^!" 0 c 9« a tggca gtggagccaa ctcagatccl 
6121 tgcaclaatt clcaotgtll Itct^TJ caaattt 9Ct cagtttcgaa gaetSSactt 
6181 ggaaatttct tatg^qcct? ltTrt?:ttt Z^Ji^ a tgactgaag acatqccttt 

630 a * tfl 9 aa 9tg gaacalcttc tcaatgct" toac?? 33 ^ %t? t9tet cacaa 9ccct 
6301 ctttaagcaa gaggaqtctc 'toa«n2-*»r r 9acctctgt gctaaggact ttgaagatct 
6361 gattgacatt ^?Ia?agcI alaa^cagc Igc^'ocla ^r 38 " 33 9c?caggtcg 
Al\ 99tgaagcta caggaagctc tctcccao?f t^tiZ 9 a a 9tgcaacgc ctgtggaaag 
6481 gtacaaggac cglLagggc gftttgalaq atctq^ 3 ^Waaaaag-ttlaSaaat 
6541 tgatataaag atatttaatc agtqq?S f™^Sf 9 aaat 99C9gc gttttcatta 
6601 aattcctgag aattgggaac Itgctaaa'a cllJaZFJ "fttctca gaaagacaca 
6661 cattgggcag cggcllactg ttqtcaaaar ah 3 ^??^ cttaa 99aac tccaggatgg 
6721 gcaatcctca aaaacagatl cSotatt?? tgCa act 999gaag aaatiattS 

6781 gtggcaggag gtctgclaac fgctqtcloa rTnll***™ '^gaagcc tgaatctgcg 
«nf tatcttgtca gaatttcafa gfgatttafa SfaSfSf? a ?9?tagaag alcaaaaga! 
6901 taacattgct agtatcccac ttgaacctoo llaf^^" ttat ?9ttgg aggaagcaga 
"61 gcaagtcaag ttactggtgg aaga«tq?I c^tnt^ caac 5 aaaa g aaaagcttga 
7021 tgaaactgga ggacccgtg? ttgtaaatSe ter?5?S 9 99aattctca aacaattaaa 
7081 tgaaaataag ctcaagcaga cal^ctccf ot^f agC c «? aa gagc aagataaact 
9aaacaagga gaaattgaag ctcaaataaa tn-rtt** 9 9tttccagag ctttacctga 
7201 agaccttgaa gagcagttal atcatctqc? q??^^?? 9 "? ctt 9 aaa aaaagcttga 
7»} ^ aaat " aC aac «accaa accaagalgg Iccltttqan i?^ CtatU 99aalcagtt 
7321 agttcaagct aaacaaccgg atgtaaaaql ? gttcaggaaa ctgaaatagc 

7381 ggaaaaacca gccactcagc cagtqlaolq Si^ tct aaa 999cagc atttgtacia 
7 44! ggcggtaaac cgtttactSc SgS^S SKa^Sg « 9a ? g 99 aa 
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FIGURE 1 (cont.) 

ggagcctctc ctactcagac tgttactctg gtgacacaac ctgtggttac 
7fi?i IttZZ t ?ccatctcca aactagaaat gccatcttcc ttgatgttgg aggtacctgc 
S£f?S" 9at " caacc 999 cttggacaga acttaccgac tggctttctc Cg5ttgat?a 
7741 lltnr^A. tcaca 9 a 599 tgatggtggg tgaccttgag gatatcaacg agatgatcat 
7Rm " a 9 ca ? aa 9 gcaacaatgc aggatttgga acagaggcgt ccccagttgg aigalctcat 
lie* ™?2 ctg ?? caaaat "9 a aaaacaagac cagiaatcaa gaggctagil caltcattac 
llti gaaagaattc agaatcagtg ggatgaagta calgaaclcc ttcagaaccg 

7M1 "^aatgaaa tgttaaagga ttcaacacaa tggctggaag ctaaggaagf 

fln!J ffSS!"? ^tcttaggac aggccagagc caagcttgag tcatggaagg agggtlccta 
ft?n! ^f a ? tagat acaatccaaa agaaaatcac agaaacclag cagttggccl allacctccq 
111] fS?^?"? acaaatgtag atgtggcaaa tgacttggcc ctgaalcttc tccgggatu 
B221 Ixnr^t^l 9 a taccagaa aagtccacat gataacagag aatatcaatg cctcttggag 
Hi]- aag f a "? at aaaa 99gtga gtgagcgaga ggctgctttg gaagaaactc atagattlct 
83U toccaaS rt^l^ fc ?9 aaaa 9tt tcttgcctgg cttacagaag ctglaacaac 
8401 c £ aca 99atg. ctacccgtaa ggaaaggctc ctagaagact ccaaqggagt 

8461 ttltZJm a * 9aaacaat ggwagacct ccaaggtgaa attgaagctc acaclgltgt 
8521 £™?r£« ^H at9aaa aca 9«aaaa aatcctgaga tccctggaag gttccgatga 
fiSfil ll£t~?%l ttacaaagac gtttggataa catgaacttc aagtggagtg aacttcggaa 
BfiSl ?1 CC aaca ttaggt cccatttgga agccagttct gacclgtgga agcgtctgca 
8701 anrlrr^lt ca *J aac "c tggtgtggct acagctgaaa gatgatgaat tlagccggca 
8701 ggcacctatt ggaggcgact ttccagcagt tcagaagcag lacgatgtac ataggacctt 
8821 ItTrfttr. ttgaaaa <*? aagaacctgt aatcatgagt actrttgaga ctgtl?gaat 
III] atfc t<=tgaca gagcagcctt tggaaggact agagaaactc taccaggaoc cclgaqlact 
8881 gcctcctgag gagagagccc agaatgtcac tcggcttcta cgaaagcagg ctolgqaoat 
9001 tgaacctgca ctclgctgac tggcagagla „2tagat£ 

9§61 coaacctaac atalt^J, a % ctt « a 9 a 99ccacggat gagctgglcc tcaagctglg 
9121 ccaaaatlfr ? atcct 9 gca 9«cgtgggc gatctcctca ttgactctct 

9181 r^fJ^^ =5 c 9 a 9 aaa 9 tcaaggcact tcgaggagaa attgcgcctc tgaaagagaa 
9241 «fJ,i~1 C 9tcaatgacc ttgctcgcca gcttaccact ttgggcattc a6ctctcicc 
9241 gtataacctc agcactctgg aagacctgaa caccagatgg aagcttctqc aootoaccat 
936 tlllltllll tTrttT 0 t9ca tgalgc ccacaggga? tttggtcclg cl^agla 
9421 ctactat^r «H^S! CC a 999tccctg ggagagagcc atctcgccaa acaaagt^cc 
9481 ctac^^r a !f= a = qa 9 a °tcaaacaac ttgctgggac catcccaaaa tgacaqagct 
9541 act^M« ^o^ gaCC ^"tfatgt cagattctca gcttatagga ctgccltgaa 
ofim a °:^? aaga ct 9 ca 9 aa gg ccctttgctt ggatctcttg agcctgtcag ctqcatotoa 
•9661 tlatta?? a c SK???''? 1 " 9 "" 3 tSaccagcc? a?gga?atc2 t^gHtl? 
9721 cMtSrtS "■fSlS!^ at9ac< =9 cct ggagcaagag cacaacaatt tggtcaacgt 
f™„ tgc S^atatgt gtctgaactg gctgctgaat gtttatgata caooacoaac 
9781 agggaggatc cgtgtcctgt cttttaaaac tggcatcatt tccctotata alccarf??? 
llol SEESS"? 9 taca f atacc ttttcaagca agtggcaigt tcaacfjga? ??tg?gacca 
llll " cctccttc tgcatgattc tltccaaatt ccaagaclgt tgg?tg«gt 

inn^ !?c atccttt gggggcagta acattgagcc aagtgtccgg agctqcttcc aitttoctfa 

10081 catgqtotqo Itocern 33 ? I llllll^l cct f9 a ctgg atgagactgg aaccccaqtc 
10141 caaaXatllr. ^tSiT—?! -cgcacagagt ggctgctgca gaaactgcca agcatcaaqc 

10201 1 S"Stm vtctltVall lltltlTcll C ttttl 9 t ttC , aggt f«" a 9?ctaaaIL C 

lSii tttlllctll IgSJS ttlStlSi aaccaaaa 99 tat??t^=ga 9 ajlatc^g 
10441 tctoltrssr f9 ac tgtctt agagggggac aacatggaaa ctcccgttac 

10561 tStMt"? ttacllattl ISttFt* 0 ? cct 9<=ctcg tcccctcagc tttcalacga 

10561 t|ga a ctt«.ctaaltaatl SSSZF" t aqc *™ ct * S^agaaatgg aaaacagcaa 
,yy a, -*- tta ' -ccaaacgata gcatctctcc taatgagagc ataqatqatq aacaetfnt-t- 

068 a |ccca^a" tlttlttSl ttl^T^ W.e?cLIc ctg^gcfag? cfcgt'agfcc 
10741 aoatctto^n „;f«^f tagagagtga ggaaagaggg gagctagaga gaatcctagc 
10801 cgaacat! 3 ! ZnJ-ttt?* " aatct 9 ca agcagaatat gawgtctaa Igcagcagla 
10861 glg a ccccqq lltactlTc S£KtT C ccctcct 9 aa atgatgccca cltctcclca 
10921 ictaaaaorr %nal*ZlJ 5 ttgCtga Wccaagcta ctgcgtcaac acaaaggccg 
10 i 1 «gll a a^g c^ct^qq aqca^c^ 3 CCaCaataaa cagctggagt cacag??acl 
11041 qtcctctclt fr?,rS;?5 agcaacccca ggcagaggcc aaagtgaatg gcacaacggt 
11101 ggttqqcaa? caa^ctt^S '^ agaggtC c 9 aca 9 ca gt cagcctatgc tgctccglgt 
11161 aicaloeala 55 actccatggg tgaggaagat cttctcagtc ctccccagga 

1122 aqgttlti™ I^cctaoII lllllltl^ gcaactcaac aa <=tccttcc ctagttcaag 

11281 a^gcfgaS SSSS ^Itl it&tl ggSSS 



FIGURE 1 (COflt) 



11341 ggagcagaat aaatgtttta caactcctga 
11401 acaaagagga ttagacagta agagtttaca 
11461 gtggtattat actgtagatt tcagtagttt 
11521 caggttttac acgtctatgc aattgtacaa 
11581 ttgatagcta aataacttgc catttcttta 
11641 ttataacagt tataaagaaa gattgtaaac 
11701 ataaaaaccc ctaaaaacaa aacaaacaca 
11761 actttgaggc agcgcattgt tttgcatcct 
11821 ctttttcttt ttttgcatat taaagataag 
11881 tacacactgc tcatttgaga actgtcagct 
11941 atatctatat gtctataagt atataaatac 
12001 tctatagact gactttttcc attttttaaa 
12061 tacttctagt cagtcatcca ggcttacctg 
12121 ggaagccagg aggaaactac accacactaa 
12181 ttttaaacaa ctttccactg acaacgaaag 
12241 aacatgtgaa tgaatacaca ggacttatta 
12301 attgattgat tgattgatac attcagcttc 
12361 aatgatgctt cagtggaaat caatcagaag 
12421 ttttttaact cccaagcagt agcaggacga 
12481 cccatccctg tgaaggagta ggccactctt 
12541 acataaagtt ctctgtaatt acaactaaat 
12601 actgggtggt ttggtttttg ttgctttttt 
12661 taaatgccac aagacataat ttaaaataaa 
12721 ccccatcaca tttgtgatac tgacaggtat 
12781 tcctttgcat ttctctgcga gtagttccac 
12841 aaattgattc aaatgttaca aaaaaaccct 
12901 aacaaacaaa caaaaattgc tcaaaaaaga 
12961 ctggtaggaa aaagctttac" tctttcatgc 
13021 tcattcaata gataccaccg tgtgacctat 
13081 agtgtaatta gcttttggag agtgggctga 
13141 gttttgtcca ttattaataa ttaattaatt 
13201 tacctcactt tggttttggg gtgttcctga 
13261 ccacttgtcc attgcgttat tttctttttc 
13321 tttcaaaaga aaacccaaag ctctaaggta 
13381 ttttgtcttg catttttttc ctttatgtga 
13441 taaaactcag atttaaaaca aggggttact 
13501 agtttcattc taaaatcaga ggtaaataga 
13561 gtttttcttt tagacacatt- agctctggag 
13621 gagagcttta ttgctgcatt ttaagcataa 
13681 tttataacca ccgagtatta aactgtaaat 
13741 ggcatgtttt gtcattgttt tcaggtactg 
13801 gttttaacac caacactgta acatttacga 
13861 ttttcacaac atatcagact tqaccaaata 
13921 ttactgtgta tctcaataaa gcacgcagtt 



ttcccgcatg gtttttataa tattcataca 
agaaataaat ctatattttt gtgaagggta 
ctaagtctgt tattgttttg ttaacaatgg 
aaaagttata agaaaactac atgtaaaatc 
tatggaacgc attttgggtt gtttaaaaat 
taaagtgtgc tttataaaaa aaagttgttt 
cacacacaca catacacaca cacacacaaa 
tttggcgtga tatccatatg aaattcatgg 
acttcctcta ccaccacacc aaatgactac 
gagtggggca ggcttgagtt ttcatttcat 
tatagttata tagataaaga gatacgaatt 
tgttcatgtc. acatcctaat agaaagaaat 
cttggtctag aatggatttt tcccggagcc 
aacattgtct acagctccag atgtttctca 
taaagtaaag tattggattt ttttaaaggg 
tatcagagtg agtaatcggt tggttggttg 
ctgctgctag caatgccacg atttagattt 
gtattctgac cttgtgaaca tcagaaggta 
tgatagggct ggagggctat ggattcccag 
taagtgaagg attggatgat. tgttcataat 
tattatgccc tcttctcaca gtcaaaagga 
agatttattg tcccatgtgg gatgagtttt 
taaactttgg gaaaaggtgt aagacagtag 
caacccagaa gcccatgaac tgtgtttcca 
acaggtttgt aagtaagtaa gaaagaaggc 
tcttggtgga ttagacaggt taaatatata 
ggagaaaagc tcaagaggaa aagctaagga 
cattttattt ctttttgatt tttaaatcat 
aattttgcaa atctgttacc tctgacatca 
catcaagtgt aattagcttt tggagagtgg 
aacatcaaac acggcttctc atgctatttc 
taattgtgca cacctgagtt cacagcttca 
ctttataatt ctttcttttt' ccttcataat 
acaaattacc aaattacatg aagatttggt 
cgctggacct tttctttacc caaggatttt 
ttacatccta ctaagaagtt taagtaagta 
gtgcataaat aattttgttt taatcttttt 
tgagtctgtc ataatatttg aacaaaaatt 
ttaatttgga cattatttcg tgttgtgttc 
cataatgtaa ctgaagcata aacatcacat 
agttcttact tgagtatcat aatatattgt 
attatttttt taaacttcag ttttactgca 
tatgccttac tattgtatta tagtactgct 
atgttac 



FIGURE 2 (Mouse Dystrophin cDNA, Acc. No. M68859, SEQ ID N0:2) 



cctcactcac ttgcccctta caggactcag ctcttgaagg caatagcttt atagaaaaaa 
cgaataggaa gacttgaagt gctatttttt tttttttttt tgtcaaggct gctgaagttt 
attggcttct catcgtacct aagcctcctg gagcaataaa actgggagaa acttttacca 
agatttttat ccctgccttg atatatactt tttcttccaa atgctttggt gggaagaagt 
agaggactgt tatgaaagag aagatgttca aaagaaaaca ttcacaaaat ggataaatgc 
acaattttct aagtttggaa agcaacacat agacaacctc ttcagtgacc tgcaggatgg 
aaaacgcctc ctagacctct tggaaggcct tacagggcaa aaactgccaa aagaaaaggg 
atctacaaga gttcatgccc tgaacaatgt caacaaggca ctgcgggtct tacagaaaaa 
taatgttgat ttagtgaata taggaagcac tgacatagtg gatggaaatc ataaactcac 
tcttggtttg atttggaata taatcctcca ctggcaggtc aaaaatgtga tgaaaactat 
catggctgga ttgcagcaaa ccaacagtga aaagattctt ctgagctggg ttcgacagtc 
aacacgtaat tatccacagg ttaacgtcat caacttcacc tctagctggt ccgacgggtt 
ggctttgaat gctcttatcc atagtcacag gcccgacctg tttgattgga atagtgtggt 
ttcacagcac tcagccaccc aaagactgga acatgccttc aacattgcaa aatgccagtt 
aggcatagaa aaacttcttg atcctgaaga tgttgctacc acttatccag acaagaagtc 
catcttaatg tacatcacat cactctttca agttttgcca caacaagtga gcattgaagc 
cattcaagaa gtggaaatgt tgcccaggac atcttcaaaa gtaactagag aagaacattt 
tcaattacat caccagatgc attactctca acagatcaca gtcagtctag cacagggcta 
tgaacaaact tcttcatctc ctaagcctcg attcaagagt ..tatgccttca cacaggctgc. 
ttatgttgcc acctctgatt ccacacagag cccctatcct tcacagcatt tggaagctcc 
cagagacaag tcacttgaca gttcattgat ggagacggaa gtaaatctgg atagttacca 
aactgcttta gaagaagtac tttcatggct tctttctgcc gaggatacat tgcgagcaca 
aggagagatt tcaaatgatg ttgaagaagt gaaagaacag tttcatgctc atgagggatt 
catgatggat ctgacatctc atcaaggact tgttggtaat gttctacag.t taggaagtca 
actagttgga aaagggaaat tatcagaaga tgaagaagct gaagtgcaag aacaaatgaa 
tctcctaaat tcaagatggg aatgtctcag ggtagctagc atggaaaaac aaagcaaatt. 
acacaaagtt ctaatggatc tccagaatca gaaattaaaa gaactagatg actggttaac 
aaaaactgaa gagagaacta agaaaatgga ggaagagccc tttggacctg atcttgaaga 
tctaaaatgc caagtacaac aacataaggt gcttcaagaa gatctagaac aggagcaggt 
cagggtcaac tcgctcactc acatggtagt agtggttgat gaatccagcg gtgatcatgc 
aacagctgct ttggaagaac aacttaaggt actgggagat cgatgggcaa atatctgcag 
atggactgaa gaccgctgga ttgttttaca agatattctt ctaaaatggc agcattttac 
tgaagaacag tgccttttta gtacatggct ttcagaaaaa gaagatgcaa tgaagaacat 
tcagacaagt ggctttaaag atcaaaatga aatgatgtca agtcttcaca aaatatctac 
tttaaaaata gatctagaaa agaaaaagcc aaccatggaa aaactaagtt cactcaatca 
agatctactt tcggcactga aaaataagtc agtgactcaa aagatggaaa tctggatgga 
aaactttgca caacgttggg. acaatttaac ccaaaaactt gaaaagagtt cagcacaaat 
ttcacaggct gtcaccacca ctcaaccatc cctaacacag acaactgtaa tggaaacggt 
aactatggtg accacaaggg aacaaatcat ggtaaaacat gcccaagagg aacttccacc 
accacctcct caaaagaaga ggcagataac tgtggattct gaactcagga aaaggttgga 
tgtcgatata actgaacttc acagttggat tactcgttca gaagctgtat tacagagttc 
tgaatttgca gtctatcgaa aagaaggcaa catctcagac ttgcaagaaa aagtcaatgc 
catagcacga gaaaaagcag agaagttcag aaaactgcaa gatgccagca gatcagctca 
ggccctggtg gaacagatgg caaatgaggg tgttaatgct gaaagtatca gacaagcttc 
agaacaactg aacagccggt ggacagaatt ctgccaattg ctgagtgaga gagttaactg 
gctagagtat caaaccaaca tcattacctt ttataatcag ctacaacaat tggaacagat 
gacaactact gccgaaaact tgttgaaaac ccagtctacc accctatcag agccaacagc 
aattaaaagc eagttaaaaa tttgtaagga tgaagtcaac agattgtcag ctcttcagcc 
tcaaattgag caattaaaaa ttcagagtct acaactgaaa gaaaagggac aggggccaat 
gtttctggat gcagactttg tggcctttac taatcatttt aaccacatct ttgatggtgt 
gagggccaaa gagaaagagc tacagacaat ttttgacact ttaccaccaa tgcgctatca 
ggagacaatg agtagcatca ggacgtggat ccagcagtca gaaagcaaac tctctgtacc 
ttatcttagt gttactgaat atgaaataat ggaggagaga ctcgggaaat tacaggctct 
gcaaagttct ttgaaagagc aacaaaatgg cttcaactat ctgagtgaca ctgtgaagga 
gatggccaag aaagcacctt cagaaatatg ccagaaatat ctgtcagaat ttgaagagat 
tgaggggcac tggaagaaac tttcctccca gttggtggaa agctgccaaa agctagaaga . 
acatatgaat aaacttcgaa aatttcagaa tcacataaaa accttacaga aatggatggc 
tgaagttgat gttttcctga aagaggaatg gcctgccctg ggggatgctg aaatcctgaa 
aaaacagctc aaacaatgca gacttttagt tggtgatatt caaacaattc a-gcccagttt 
aaatactgtt aatgaaggtg ggcagaagat aaagagtgaa gctgaacttg agtttgcatc 
cagactggag acagaactta gagagctcaa cactcagtgg gatcacatat gccgccaggt 



FIGURE 2 (cont) 

3661 ctacaccaga aaggaagcct taaaggcagg 
3721 tctatcagag atgcatgagt ggatgacaca 
3781 tgaatataaa actccagatg aattacagac 
3841 agaggcacta. caaaaagaaa ctaaagtgaa 
3901 agctcacgct ccaccctcag cacaagaggc 
3961 caactaccaa tggctgtgca ccaggctgaa 
4021 ggcatgttgg catgagttat tgtcatattt 
4081 agaattgaaa cttaaaacca tggaaaatgt 
4141 gctagaatct cttgaaaatc tgatgcatca 
4201 attggcacag actcttacag atggaggagt 
4261 gacgtttaat tctcgttgga gggaactaca 
4 321 tgaacagagt atccagtctg cccaggaaat 
4381 gcttgaattc attgacaagc agttggcagc 
4441 aatgcctcag gaagcccaga aaatccaatc 
4 501 agaaatgaag aaacataacc aggggaagga 
4561 tgttgcacag aaaaaattac aagatgtctc 
4 621 caattttgaa caacgtctag aggaaagtaa 
4 681 gcctgcattg gaaaccaaga gtgttgaaca 
4741 tgtgaacttg tataaaagcc tgagtgaagt 
4801 cggacgtcaa attgtacaga . aaaagcagac 
4861 aacagctttg aaattgcatt acaatgagtt 
4921 gttggagaaa tgcttgaagt tgtcccgtaa 
4981 atggctggca gcaacagata cagaattgac 
5041 taatttggat tctgaagttg cctggggaaa 
5101 ggctcacttg aagagtgtta cagaattagg 
5161 agaaaccttg gtagaagata aactgagtct 
5221 cagagtagaa gaatggctaa atcttttgtt 
5281 tcagaacata gaacaaatca caaagtggat 
5341 tgaaaagaag aaaccacaac aaaaggaaga 
5401 tgacatgcgc ccaaaggtgg actccacacg 
5461 cggtgaccac tgcaggaaag tagtagagcc 
5521 agctatttct cacagaatta agactggaaa 
5581 gtttaactca gatatacaaa aattgcttga 
5641 gaatctgaaa gaggaagact tcaataaaga 
5701 tgaattgttg caaagaggag acaacttaca 
5761 ggaaataaag ataaaacagc agctgttaca 
5821 gtctcaaaga agaaaaaagg ccctagaaat 
5881 ggctgatgat ctcctgaaat gcttggatga 
5941 acccagagat gaaagaaaat taaaggaaat 
6001 gctgaatgca gtgcgcaggc aagctgaggg 
6061 ggagccaact cagatccagc tcagcaagcg 
6121 gtttcgaaga ctcaactttg cacaaattca 
6181 gactgaagat atgcctttgg atgtttctta 
6241 tcatatctta caagctcttt cagaagttga 
6301 taaagatttt gaagatcttt ttaagcaaga 
6361 gcaacaaatc tcaggtcgga ttgatattat 
6421 tgccacctcc atggaaaagg tgaaagtaca 
6481 ggaaaaactt catagaatgt acaaggaacg 
6541 atggcgacac tttcattatg atatgaaggt 
6601 gtttttcaaa aagacacaaa atcctgaaaa 
6661 taaggaactc caggatggca ttgggcagcg 
6721 tggggaagaa ataattcaac agtcttcaaa 
6781 aggaagcttg- agtctgcggt ggcacgacat 
6841 gattgaagaa caaaagaatg tcttgtcaga 
6901 gtggctggaa gaagcagata acattgctat 
6961 agaacaactt gaacaagtca agttactggc 
7021 aaaacaatta aatgaaacag gaggagcagt 
7081 gcaagataaa cttgaaaaga agctcaaaca 
7141 agctttacct gagaaacaag gagagcttga 
7201 agagcagctg gatcacctgc ttctgtggct 
7261 taaccaacca agtcaggcag gaccgtttga 
7321 taaacaagcg gatgtggaaa ggcttttgtc 
7381 aagcactcag ccagtgaaga ggaagttaga 
7441 ccatttactt cgggagctga ggacaaagca 



tttggataaa accgtaagcc tccaaaaaga 
agctgaagaa gaatatctag agagagattt 
tgctgttgaa gaaatgaaga gagctaaaga 
actccttact gagactgtaa atagtgtaat - 
cttaaaaaag gaacttgaaa ctctgaccac 
tggaaaatgc aaaactttgg aagaagtttg 
agagaaagca aacaagtggc tcaatgaagt 
tcctgcagga cctgaggaaa tcactgaagt 
ttcagaggag aacccaaatc agattcgtct 
catggatgaa ctgatcaatg aggagcttga 
tgaagaggct gtgaggaaac aaaagttgct 
tgaaaagtcc ttgcacttaa ttcaggagtc 
ttatatcact gacaaggtgg atgcagctca 
agatttgaca -agtcatgaga taagtttaga 
tgccaaccaa agggttcttt cacaaattga 
catgaaattt cgattattcc aaaaaccagc 
gatgatttta gatgaagtca agatgcattt 
ggaagtaatt cagtcacaac taagtcattg 
caagtctgaa gtggaaatgg tgattaaaac 
agaaaatccc aaagagcttg .atgaacgagt 
gggtgcgaag gtaacagaga gaaagcaaca 
gatgagaaag gaaatgaatg tcttaacaga 
gaagagatca gcagttgaag gaatgccaag 
ggctactcaa. aaagagattg agaaa'cagaa 
agagtctttg aaaatggtgt tgggcaagaa 
tctgaacagt aactggatag ctgtcacctc 
ggaataccag aaacacatgg aaacctttga 
cattcatgca gatgaacttt tagatgagtc 
cattcttaag cgtttaaagg ctgaaatgaa 
tgaccaagca gcaaaattga tggcaaaccg 
ccaaatctct gagctcaacc gtcgatttgc 
ggcctccatt cctttgaagg aattggagca 
accactggag gctgaaattc agcagggggt 
tatgagtgaa gacaatgagg gtactgtaaa 
acaaagaatc acagatgaga gaaagcgaga 
gacaaaacat aatgctctca aggatttgag 
ttctcaccag tggtatcagt acaagaggca 
aattgaaaaa aaattagcca gcctacctga 
tgatcgtgaa ttgcagaaga agaaagagga 
cttgtctgag aatggggccg caatggcagt 
ctggcggcaa attgagagca attttgctca 
cactctccat gaagaaacta tggtagtgac 
tgtgccttct acttatttga ccgagatcag 
tcatcttcta aatactcctg aactctgtgc 
ggagtctctt aagaatataa aagacaattt 
tcacaagaag aagacagcag ccttgcaaag 
ggaagccgtg gcacagatgg atttccaggg 
acaagggcga ttcgacagat cagttgaaaa 
atttaatcaa tggctgaatg aagttgaaca 
ctgggaacat gctaaataca aatggtatct 
tcaagctgtt gtcagaacac tgaatgcaac 
aacagatgtc aatattctac aagaaaaatt 
ctgcaaagag ctggcagaaa ggagaaagag 
atttcaaaga gatttaaatg aatttgtttt 
tactccactt ggagatgagc agcagctaaa 
agaagagttg cccctgcgcc agggaattct 
acttgtaagt gctcccataa .ggccagaaga 
gacaaatctc cagtggataa aggtctccag 
ggttcactta aaagatttta ggcagcttga 
ctctcctatt agaaaccagt tggaaattta 
cataaaggag attgaagtaa cagttcacgg 
gaaagggcag catttgtata aggaaaaacc 
agatctgagg tctgagtggg aggctgtaaa 

gcctgaccgt gcccctggac tgagcactac 



FIGURE 2 (cont.) 
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tggagcctct 
tgtcatctcc 
cttcaaccga 
atcacagaga 
ggcaacactg 
ccagaatttg 
tgaaagaatt 
gttgaatgaa 
ggtcatagga 
tgcaatccaa 
gataagtgta 
tgataccaga 
taaaagagta 
ccctctggac 
cctacaggac 
gatgaaacca 
tcttgatgaa 
gttacaaaga 
caacattagg 
tcaggaactt 
cggtggtgat 
attgaaaact 
agagcagcct 
agaaagagct 
atgggacaaa 
aagactccag 
ggtgatcaag 
ccttgaaaaa 
tgtcaatgac 
cagcactttg 
tgtcagacag 
cacttcagtt 
caaccacgag 
tttagctgac 
gctccagaag 
ccagcacaac 
gactacaatt 
tgtggatatg 
ccgtgtcctg 
gtacagatac 
gggtcttctt 
tgggggcagt 
tgagattgaa 
gctgcccgtc 
catctgtaag 
tgacatctgc 
ccccatggta 
ggtactaaaa 
cctgccagtg 
cttctggcca 
ttcacgcatt 
tctaaatgat 
ttactgccaa 
cttgatttcc 
ggaagaaaac 
aggcctgtct 
gg.atgctgag 
caggatgcaa 
acagctcctg 
ttccacctct 
tcaaacttca 
agggttagaa 
tgcccccgga 
gatttgggca 



gccagtcaga 
aaactagaaa 
gcttggacag 
gtgatggtgg 
caagatttgg 
aaaaacaaaa 
cagattcagt 
atgttaaagg 
caggtcagag 
aagaagatca 
gacgtggcaa 
aa'agtacaca 
agt'gagcaag 
ctggagaagt 
gcttcccgta 
tggcaagatc 
aatggccaaa 
cgtttggata 
tcccatttgg 
cttgtttggc 
ttcccagcag 
aaagaacctg 
ttggaaggac 
cagaatgtca 
ttgaacctgc 
gaacttcagg 
ggatcctggc 
gtcaaggcac 
cttgcacatc 
gaagatctga 
ctgcatgaag 
cagggtccc't 
acccaaacca 
ctgaataatg 
gccctttgct 
ctcaagcaaa 
tatgatcgtc 
tgtctca'act 
tcttttaaaa 
cttttcaagc 
ctgcatgatt 
aacattgagc 
gctgctctct 
ttgcacagag 
gagtgtccaa 
caaagttgct 
gagtattgca 
aacaaatttc 
cagactgtgt 
gtagattctg 
gaacattatg 
agcatctctc 
agtttgaacc 
ttagagagtg 
aggaatctgc 
ccactgccat 
ctcattgctg 
a'tcctggaag 
gagcagcccc 
ctgcagaggt 
gaatctatgg 
gaagtgatgg 
aagccaatga 
gagcgatgga 



ctgttactct 
tgccatcttc 
aacttacaga 
gtgatctgga 
aacagagacg 
ccagcaatca 
gggatgaggt 
attcaacaca 
gcaagcttga 
cagaaaccaa 
atgatttggc 
tgataacaga 
aggctgcttt 
ttctttcctg 
aggagaagct 
tccaaggaga 
aaatcctgag 
acatgaattt 
aagcaagttc 
tacagctgaa 
ttcagaagca 
taatcatgag 
tagagaaact 
ctcggctcct 
gctcagctga 
aagctgccga 
agccagtggg 
ttcggggaga 
agctgaccac 
ataccagatg 
cccacaggga 
gggagagagc 
cttgttggga 
tcaggttctc 
tggatctctt 
atgaccagcc 
tggagcaaga 
ggcttctcaa 
ctggcatcat 
aagtggcaag 
ctattcaaat 
cgagtgtcag 
tccttgactg 
tggctgctgc 
tcattggatt 
ttttttctgg 
ctccgactac 
gaaccaaaag 
tagaggggga 
cgcctgcctc 
ctagcaggct 
ctaatgagag 
aggactcccc 
aggaaagagg 
aagcagaata 
ctcctcctga 
aggctaagct 
accacaataa 
aggctgaagc 
cagatagcag 
gtgaggaaga 
agcaactcaa 
gagaggacac 
gtccttagtt 



agtgacacaa 
tttgctgttg 
ctggctgtct 
agacatcaat 
cccccaattg 
agaagctaga 
tcaagaacag 
atggctggaa 
ctcatggaaa 
gcagttggcc 
actgaaactt 
gaatatcaat 
ggaagaaact 
gattacggaa 
cctagaagac 
aattgaaact 
atccctggaa 
caagtggagt 
tgaccagtgg 
agatgatgaa 
gaatgatata 
tactctggag 
ctaccaggag 
acgaaagcag 
ttggcagaga 
tgaactggac 
ggatctcctc 
aattgcacct 
actgggcatt 
gaggcttcta 
ctttggtcct 
catctcacca 
ccaccccaaa 
cgcgtatagg 
gagcctgtca 
catggatatc 
gcacaacaat 
tgtttatgat 
ttctctgtgt 
ttcaactggc 
cccaagacag 
gagctgcttc 
gatgcgcctg 
tgaaactgcc 
caggtacaga 
ccgagttgca 
atccggagaa 
gtattttgcg 
caacatggaa 
gtccccccag 
agcagaaatg 
catagatgat 
cctgagccag 
ggagctagag 
tgatcgcctg 
gatgatgccc 
actgcgccaa 
acagctggag 
taaggtgaat 
tcagcctatg 
tcttctgagt 
caactccttc 
aatgtaggaa 
tcagtcatga 



tctgtggtta 
gaggtacctg 
ctgcttgatc 
gaaatgatca 
gaagaactca 
acaatcatta 
ctgcagaaca 
gctaaggaag 
gaaggtcctc 
aaagacctcc 
cttcgggact 
acttcttggg 
catagattac 
gcagaaacaa 
tccaggggag 
cacacagata 
ggttcggatg 
gaacttcaga 
aagcgtttgc 
ctgagccgtc 
catagggcct 
actgtgagaa 
cccagagaac 
gctgaagagg 
aaaatagatg 
ctcaagttgc 
attgactctc 
cttaaagaga 
cagctctcac 
caggtggctg 
gcatcccagc 
aacaaagtgc 
atgacagagc 
actgccatga 
gctgcatgtg 
ctgcagataa 
ctggtcaatg 
acgggacgaa 
aaagcacact 
ttttgtgacc 
ttgggtgaag 
caatttgcca 
gaaccccagt 
aagcatcaag 
agcctaaagc 
aagggccata 
gatgttcgcg 
aagcatcccc 
actcccgtta 
ctttcacacg 
gaaaacagca 
gaacatttgt 
cctcgtagtc 
.agaatcctag 
aagcagcagc 
acctctcctc 
cacaaaggac 
tctcagttac 
ggcaccacgg 
ctgctccgag 
cctccccagg 
cctagttcaa 
gccttttcca 
cagatgaaga 



ctaaggaaac 
cactggcaga 
gagttataaa 
tcaaacagaa 
ttactgctgc 
ctgatcgaat 
ggagacaaca 
aagccgaaca 
acacagtaga 
gtcaacggca 
attctgctga 
gaaacattca 
tgcagcagtt 
ctgccaatgt 
tcagagagct 
tctatcacaa 
aagcacccct 
aaaagtctct 
atctttctct 
aggcacccat 
tcaagaggga 
tatttctgac 
tgcctcctga 
tcaacgctga 
aagctcttga 
gccaagctga 
tgcaagatca 
atgtcaatcg 
cttataacct 
tggaggaccg 
acttcctttc 
cctactatat 
tctaccagtc 
agctcagaag 
atgccctgga 
ttaactgttt 
tccctctctg 
cagggaggat 
tggaagacaa 
agcgtaggct 
ttgcttcctt 
ataataaacc 
ctatggtgtg 
ccaagtgtaa 
attttaatta 
aaatgcacta 
acttcgccaa 
gaatgggcta 
ctctgatcaa 
atgatactca 
atggatctta 
taatccagca 
ctgcccagat 
cagatcttga 
atgagcataa 
agagtcccag 
gcctggaagc 
atagactgag 
tgtcctctcc 
tggttggcag 
acacaagcac 
gaggaagaaa 
catggcagat 
aggagcagaa 



FIGURE 2 (COilt) 

11341 taaatgtttt acaactcctg attcccgcat 
11401 attagacagt aagagtttac aagaaataaa 
114 61 atactgtaga tttcagtagt ttctaagtct 
11521 acacgtctat gcaattgtac aaaaaagtta 
11581 ataacttgcc atttctttat atggaacgca 
11641 ataaagagag attgtaaact aaagtgtgct 
11701 aaacaaacac acacgcacac acacacacac 
11761 atgcacgaac ccaccacaca cacacacaca 
11821 gcattacttt a'gcgtggtat tcatatggaa 
11881 gaaccccacc aaatgactgc t.tcatattgc 
11941 ggctatgggc tttcatttta tacatctata 
12001 atagataaat agatatgaag ttacttcttc 
12061 gcttctagtc atctgggctt atctgcttgg 
12121 ccaggagact accgccacac taaaatattg 
12181 ctttccactg agagctagag gattcatttt 
12241 acttactatc atagtaattt gttggctgat 
12301 taatgatgtt tctgcaatag aacatcagat 
12361 agatggtagg agctaaaggt tgcggctcct 
12421 ctgtaaaaga atggatgaat atttacaact 
12481 aaattgttct gccctcttca taaactcaac 
12541 ttgttgttgt tgatgatgat gatgaatttt 
12601 tcattgtgat gatttttttt tttaatgctg 
12661 cccaatcaca ttgtgaccct ggtgaatatc 
12721 cctttgcatt ggtttccctg caagtaactc 
12781 ggttggaaag ttttgagagc aaaagcgtct 
12841 aatgtctaaa caaatgcaag tcattgaacc 
12901 ctgctaggaa gagctttact cctctcatgc 
12961 ttctctcaat agaaatcact gtcctatcat 
13021 gtaattaact tctagcgagt gggttttgtc 
13081 gcttctcatg ctatttct'ac ctcactttgg 
13141 ctaatttcac aacttcacca cttgtctgtt 
13201 taatttccaa aagaaaaccc aaagctctaa 
13261 gcttttctct ctctctctcc tttatgtggc 
13321 tttcaccaag atttaaaaca aggggttcct 
13381 aaaatccaag gtagatagag tgcatagttt 
134 41 tattagttct ggagtgaatc tatcaaaata 
13501 attttaagca taatttggac atcatttcat 
13561 aaatcataat cagtgtaact gaagcataat 
13621 aggtactgga ctcttacttg agtatcataa 
1 13681 atttactaat tattttttta aacttcagtt 
13741 accaaatata tgccttacta ttgtattata 
13801 acgcagttat gttac 



ggtttttata atattcgtac aacaaagagg 
atctatattt ttgtgaaggg tagtggtact 
gttattgttt tgttaacaat ggcaggtttt 
aaagaaaaca tgtaaaatct tgatagctaa 
ttttgggttg tttaaaaatt tataacagtt 
ttataaaaaa agttgtttat aaaaacccct. 
acacacacac acacacacac gcacacatac 
cacacacaca ctgaggcagc acattgtttt 
ttcatgacgt ttttttattt tcttgcatac 
tcttttgaga attgttgact gagtggggct 
tgtctacaag tatataaata ctataggtat 
aaatgttctt gccacttcct aatggaaatt 
gcaagagtga attttccctg gagcccaaag 
tctagggctc cagatgtttc tagttttaaa 
tttcaaggaa catgcgaatg aatacacagg 
atattcaact tcctactgtt gggttatatt 
gacattttta actcccagac agtaggagga 
cagtcaattt. atatgagggg agcaacaact 
atacatataa acatctctat aattacaact 
ctgaagtggg tggttttgtt gttgttgttg 
agattttaga ttttttgggt ttttttttct 
caagacttag gattactgtt aagaaagtaa 
agtccagaag cccatgaact gcatttgtct 
cacacaggat tgtgggtgag aaggcacagt 
ccaaactctc tggtctagtt gacgggctga 
aggagaaaaa gtgcaacaga aagctaagga 
cagtttcttc ttcttagcat ttaaagagca 
tttgcaaatc tgttacctct aacgtcaagt 
cattattaat tgtaattaac atcaaacaca 
ttttggggtg tttctagtaa ttgtgcacac 
gtgtggacac cagtttcctt ttttcattta 
gataacaaat tgaaatttgg ttctggtctt 
actgggcatt ttctttatcc aaggatttgt 
ttcctactaa gaagttttaa gtttcattct 
tgttttaatc ttttcgtttt atcttttaga 
tttgaataaa aactgagagc tttattgctg 
gttctttata accatcaagt attaaagtgt 
catcacatgg catgtatcat cattgtctcc 
tagattgtgt tttaacacca acactgtaac 
ttactgcatt ttcacaacat atcagatttc 
ttactgcttt actgtgtatc tcaataaagc 



FIGURE 3 (Human Utrophin cDNA, Acc. No. X69086, SEQ ID NO: 3) 
1 atggccaagt atggagaaca tgaagccagt cctgacaatg ggcagaac^ attcajtgat 
61 atcattaagt ocagatctga tgaacacaat 9^9"=^ ataatatatt cacagacctc 
121 ataaatgctc gattttcaaa Wtggg|« ^ a "catca caggaacltc actgccaaag 
181 aaagatggaa ggaagctatt ggatcttcta ^aggccuca yy gca ggtttta 
241 gaacgtggtt ccacaagggt acatgcctta aataacgtca ^cagagLgc y ^ 
301 catcagaaca atgtggaatt agitata gggggaactg acattgtgga jg 
361 aaactgactt tggggttact ttggagcatc ^ttttgcact 9 cagctgggtg 

421 aaggatgtca tgtcggacct gcagcagacg aa«gtgaga |g^cctgct cage yyyj 
481 cgtcagacca ccaggcccta cagccaagtc aacgtcctca ^cttcaccac y yy 
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1021 gatgttgaag aagtcaaaga ecagtttgea aawatgaag ««tatgac 

1081 gcacaccaga gcagtgtggg cagcgtcctg c a ggfaggca ««a«g« aa W 
1141 Ictctgtcag acgaagaaga atttgagatt caggaacaga Jgaccctgct 9 
1201 tgggaggctc ttagggtgga gagtatggac fgacagtccc 9|«gca 9 y y y 
1261 gaactgeaga agaagcaact gcagcagctc tccgcctggt . Hagctgcta 

1321 attcagaaga tggaaacttg ccccctggat ^tgatgtaa "tctctaca a g j 
1381 gaagaacata aaagtttgca aagtgatctt gaggctgaac aggcg««3' tatcctagaa 
1441 Ictcacatgg tggtcattgt tgatgaaaac agtggtgaga . a aa 

1501 gaccagttac agaaacttgg tgagcgctgg acagcagtat g«g"ggac ty y y 
1561 ?ggaa?aggt tacaagaaat caatatattg tggcaggaat tattggaaga |cagtgcttg 
1621 ttgaaagctt ggttaaccga aaaagaagag gctttaaata aagtccagac "y^=. 
1681 aalgaccaaa Iggaactaag tgtcagtgtt egaegtctgg ctattttgaa ggaagacatg 
1741 qaaatqaagc gtcaaacatt ggatcagctg agtgagattg gecaggatgt gsgacaatca 
1801 Stgataatt Icaaggcatc taagaagatc aacagtgact cagaggaact Qactcaaaga 
ioci ^inan-rrh taattcaaaa actaqaagat tcctccaacc aggtgactca ggctgtagca 

Ib'i ™ » Illllili 

2041 cagltccatg tggatattga agctaagaaa aagtttgatg ctataagtgc agagctgttg 
aartaoattt tqaaatqqaa aactgecatt cagaccacag agataaaaga gtacaigaag 
6 algcalgaca cftccgHat gaaaaagaag ttgaaggcat tagaaaaag. acagagaga. 
anlAtrrrca naacaaataa attaaaccaa actggacaaa tccttgtgga gcaaatggga 
22 Ilagaaggcc ffl! agaaataaaa aatgttctgg agaaggtttc ■tcaga.tgg 
aaoaatotat ctcaacattt ggaagatcta gaaagaaaga ttcagctaca ggaagacata 
2401 utgettatt tcaagcag" Igatgagctt gaa.aggtca tca.gacaaa 
2461 gtalaacaca cttccatttc tgaatcttcc cggcagtcct tgecaagett gaaggattw 
?V?\ latllccaaa aattqacaaa tcttcttggc cttcacccca aaattgaaat ggctcgtgca 
llll agctgctcgl ccctgatgtc tcagcct?ct gccccagatt ttgtccagcg |ggc«cgat 
2641 aactttctqq gccgctacca agctgtacaa gaggctgtag aggategtea a «acatcta 
2701 glgaatgal? tgalgggcca acctggacat gcatatctgg aaacattgaa aa "^gaaa 
2761 latgtgctaa atgattcaga aaataaggee caggtgtctc tgaatgtcct taatgatctt 
2821 Icclalgtgg agaaggcect gcaagaaaaa aagacccttg atgaaatcct tgagaatcag 
2881 aaacctgeat tacataaact tgcagaagaa acaaaggctc tggagaaaaa «g"oatcct 
2941 gatgtagaaa aattatataa gcaagaattt gatgatgtgc aaggaaagtg gaacaagcta 
3001 aaggtcttgg tttccaaaga tctacatttg cttgaggaaa ttgctctcac actcagagct 
3061 tttgaggccl attcaacagt cattgagaag tggatggatg g=gtg aaa g a ££Sc?ct 
ti5i aaacaacaaa ctacccaaqq agacgacgea ggtctacaga ggcagttaga ccagtgctcc 
nil gcatttgtta atgaaltaga aSeaittgaa tcatctctga aaaacatgaa ggaaatagag 
32 1 Ictaatcttc galgtggt^c agttgctgga ataaaaactt ^g^gac aa ^"??t 
3301 gactaccaaa ctcaactgga gaaacttagc aaggagatcg "actcaaaa "gtaggttg 
3361 tctgaaagtc aagaaaaagc tgcgaacctg aagaaagact tggcagagat p»utj 
3421 atglcccagg ccgaggaaga atatttggag egggattttg agtacaagtc aceagaagag 
3481 cttgagagtg ctgtggaaga gatgaagagg gcaaaagagg a tgtgttgca g aa |g a g9^ 
3541 agaotgaaga ttctcaagga caacatcaag ttattagctg ccaaggtgcc ctctggtggc 
3601 clglagttga egtctgaget gaatgttgtg ctggagaatt accaacttct ttgtaacaga 



Fl^RE 3 (cont) 

3661 attcgaggaa agtgccacac gctagaggag gtctggtctt gttggattga actgcttcac 
3*721 tatttggatc ttgaaactac ctggttaaac actttggaag agcggatgaa gagcacagag 
3*781 gtcctgcctg agaagacgga tgctgtcaac gaagccctgg agtctctgga atctgttctg 
3841 cgccacccgg cagataatcg cacccagatt cgagagcttg gccaga.ctct gattgatggg 
3901 gggatcctgg atgatataat cagtgagaaa ctggaggctt tcaacagqcg atatgaagat 
3961 ctaagtcacc tggcagagag caagcagatt tctttggaaa agcaactcca ggtgctgcgg 
4021 gaaactgacc agatgcttca agtcttgcaa gagagcttgg gggagctgga caaacagctc 
4081 accacatacc tgactgacag gatagatgct ttccaagttc cacaggaagc tcagaaaatc 
4141 caagcagaga tctcagccca tgagctaacc ctagaggagt tgagaagaaa tatgcgttct 
4201 cagcccctga cctccccaga gagtaggact gccagaggag gaagtcagat ggatgtgcta 
4261 cagaggaaac tccgagaggt gtccacaaag ttccagcttt tccagaagcc agctaacttc 
4321 gagcagcgca tgctggactg caagcgtgtg ctggatggcg tgaaagcaga acttcacgct 
4381 ctggatgtga aggacgtaga ccctgacgtc atacagacgc acctggacaa gtgtatgaaa 
4441 ctgtataaaa ctttgagtga agtcaaactt gaagtggaaa ctgtgattaa aacaggaaga 
4 501 catattgtcc agaaacagca aacggacaac ccaaaaggga tggatgagca gctgacttcc 
4 561 ctgaaggttc tttacaatga cctgggcgca caggtgacag aaggaaaaca ggatctggaa 
4 621 agagcatcac agttggcccg gaaaatgaag aaagaggctg cttctctctc tgaatggctt 
4 681 tctgctactg aaactgaatt ggtacagaag tccacttcag aaggtctgct tggtgacttg 
4 741 gatacagaaa tttcctgggc taaaaatgtt ctgaaggatc tggaaaagag aaaagctgat 
4801 ttaaatacca tcacagagag tagtgctgcc ctgcaaaact tgattgaggg cagtgagcct 
4861 attttagaag agaggctctg cgtccttaac gctgggtgga gccgagttcg tacctggact 
4921 gaagattggt gcaatacctt gatgaaccat cagaaccagc tagaaatatt tgatgggaac 
4 981 gtggctcaca taagtacctg gctttatcaa gctgaagctc tattggatga aattgaaaag- 
5041 aaaccaacaa gtaaacagga agaaattgtg aagcgtttag tatctgagct gga-tgatgcc 
5101 aacctccagg ttgaaaatgt ccgcgatcaa gcccttattt tgatgaatgc ccgtggaagc 
5161 tcaagcaggg agcttgtaga accaaagtta gctgagctga ataggaactt tgaaaaggtg 
5221 tctcaacata tcaaaagtgc caaattgcta attgctcagg aaccattata ccaatgtttg 
5281 gtcaccactg aaacatttga aactggtgtg cctttctctg acttggaaaa attagaaaat 
5341 gacatagaaa atatgttaaa atttgtggaa aaacacttgg aatccagtga tgaagatgaa 
5401 aagatggatg aggagagtgc ccagattgag gaagttctac aaagaggaga agaaatgtta 
5461 catcaaccta tggaagataa taaaaaagaa' aagatccgtt tgcaattatt acttttgcat 
5521 actagataca acaaaattaa ggcaatccct attcaacaga ggaaaatggg tcaacttgct 
5581 tctggaatta gatcatcact tcttcctaca gattatctgg ttgaaattaa caaaatttta 
5641 ctttgcatgg atgatgttga attatcgctt aatgttccag agctcaacac tgctatttac 
5701 gaagacttct cttttcagga agactctctg aagaatatca aagaccaact ggacaaactt 
5761 ggagagcaga ttgcagtcat tcatgaaaaa cagccagatg tcatccttga agcctctgga 
5821 cctgaagcca ttcagatcag agatacactt actcagctga atgcaaaatg ggacagaatt 
5881 aatagaatgt acagtgatcg gaaaggttgt tttgacaggg caatggaaga atggagacag 
5941 ttccattgtg accttaatga cctcacacag tggataacag aggctgaaga attactggtt 
6001 gatacctgtg ctccaggtgg cagcctggac ttagagaaag ccaggataca tcagcaggaa 
6061 cttgaggtgg gcatcagcag ccaccagccc agttttgcag cactaaaccg aactggggat 
6121 gggattgtgc agaaactctc ccaggcagat ggaagcttct tgaaagaaaa actggcaggt 
6181 ttaaaccaac gctgggatgc aattgttgca gaagtgaagg ataggcagcc aaggctaaaa • 
6241 ggagaaagta agcaggtgat gaagtacagg catcagctag atgagattat ctgttggtta 
6301 acaaaggctg agcatgctat gcaaaagaga tcaaccaccg aattgggaga aaacctgcaa 
6361 gaattaagag acttaactca agaaatggaa gtacatgctg aaaaactcaa atggctgaa.t 
6421 agaactgaat tggagatgct ttcagataaa agtctgagtt tacctgaaag ggataaaatt 
6481 tcagaaagct taaggactgt aaatatgaca tggaataaga tttgcagaga ggtgcctacc 
6541 accctgaagg aatgcatcca ggagcccagt tctgtttcac. agacaaggat tgctgctcat 
6601 cctaatgtcc aaaaggtggt gctagtatca tctgcgtcag atattcctgt tcagtctcat 
6661 cgtacttcgg aaatttcaat tcctgctgat cttgataaaa ctataacaga actagccgac 
6721 tggctggtat taatcgacca gatgctgaag tccaacattg tcactgttgg ggatgtagaa 
6781 gagatcaat-a agaccgtttc ccgaatgaaa attacaaagg ctgacttaga acagcgccat 
6841 cctcagctgg attatgtttt tacattggca cagaatttga aaaataaagc ttccagttca 
6901 gatatgagaa cagcaattac agaaaaattg gaaagggtca agaaccagtg ggatggcacc 
6961 cagcatggcg ttgagctaag acagcagcag cttgaggaca tgattattga cagtcttcag 
7021 tgggatgacc atagggagga gactgaagaa ctgatgagaa aatatgaggc tcgactctat 
7081 attcttcagc aagcccgacg ggatccactc accaaacaaa tttctgataa ccaaatactg 
7141 cttcaagaac tgggtcctgg agatggtatc gtcatggcgt tcgataacgt cctgcagaaa 
7201 ctcctggagg aatatgggag tgatgacaca aggaatgtga aagaaaccac agagtactta 
7261 aaaacatcat ggatcaatct caaacaaagt attgctgaca gacagaacgc cttggaggct 
7321 gagtggagga cggtgcaggc ctctcgcaga gatctggaaa acttcctgaa gtggatccaa 
7381 gaagcagaga ccacagtgaa tgtgcttgtg gatgcctctc atcgggagaa tgctcttcag 
7441 gatagtatct tggccaggga actcaaacag cagatgcagg acatccaggc agaaattgat 



TIGURb 3 (cont) 

7 501 gcccacaatg acatatttaa aagcattgac 
7561 ggaaattctg aagaggctac tatgcttcaa 
7 621 aatgacttaa aagcaaaatc tgctagcatc 
7681 tggaacaggt tgctgatgtc cttagaagaa 
7741 gagcttaaga aacaaatgcc tattggagga 
7801 cattgtaagg ccctgagacg ggagttaaag 
7861 gaccaggccc gagttttctt ggctgatcag 
7921 aacctacaat caaaaacaga attaactcct 
7981 atgcgcaaac agtcttctga agtcaaagaa 
8041 aattggcaaa agcaagtgga caaggcattg 
8101 gatgacctgg acgctgacat gaaggaggca 
8161 ggagacttac tcattgactc gctgcaggat 
8221 gaaattgcac caatcaactt taaagttaaa 
8281 ccacttgacc tgcatccctc tctaaagatg 
8341 tggaaacttt tacaggtttc tgtggatgat 
8401 gattttggac catcctctca gcattttctc 
8461 tccatttcac ataataaagt gccctattac 
8521 gaccatccta aaatgaccga actctttcaa 
8581 tctgcctacc gtacagcaat caaaatccga 
8641 ttagagttga gtacaacaaa tgaaattttc 
8701 ctcctcagtg ttccagatgt catcaactgt 
87 61 atgcataagg acctggtcaa cgttccactc 
8821 aatgtctatg acacgggtcg aactggaaaa 
8881 atgtctctct ccaaaggtct cttggaagaa 
8941 gggccgacag aaatgtgtga ccagaggcag 
9001 atcccccggc agctaggtga agtagcagct 
9061 cgcagctgct tccaacagaa taacaataaa 
9121 tggatgcatt tggaaccaca gtccatggtt 
9181 gcggagactg caaaacatca ggccaaatgc 
9241 ttcaggtata gaagccttaa gcattttaac 
9301 ggtcgaacag caaaaggtca caaattacat 
9361 acatctgggg aagatgtacg agacttcaca 
9421 aagtactttg ccaaacaccc tcgacttggt 
9481 gacaacttag agactcctat cacactcatc 
9541 caatctcctc aactgtttca tgatgacacc 
9601 ctggcccaga tggaaaggac taatgggtct 
9661 agtgtggaag acgagcacgc cctcatccag 
9721 ccagtgagcc agccgcagag cccagctcag 
9781 ggagaactgg agaggatcat tgctgacctg 
9841 tatgagcagc tgaaggacca gcacctccga 
9901 tcgattatat ctccccatca cacgtctgag 
9961 ctcaggcagc acaaaggtcg gctggaggct 
10021 cagctggagt ctcagctcca ccgcctccga 
10081 cgaatcaatg gtgtttcccc atgggcttct 
10141 gatccagatg cctccggccc acagttccac 
10201 ccgcacgaca ccagcacgga tctcacggag 
10261 tcttgctgcc caaatgttcc cagcaggcca 



ggaaacaggc agaagatggt aaaagctttg 
catcgactgg atgatatgaa ccaaagatgg 
agggcccatt tggaggccag cgctgargaag 
ctgatcaaat ggctgaatat gaaagatgaa 
gatgttccag ccttacagct ccagtatgac 
gagaaagaat attctgtcct gaatgctgtc 
ccaattgagg cccctgaaga gccaagaaga 
gaggagagag cccaaaagat tgccaaagcc 
aaatgggaaa gtctaaatgc tgtaactagc 
gagaaactca gagacctgca gggagctatg 
gagtccgtgc ggaatggctg gaagcccgtg 
cacattgaaa aaatcatggc atttagagaa 
acggtgaatg atttatccag tcagctgtct 
tctcgccagc tagatgacct taatatgcga 
cgccttaaac agcttcagga agcccacaga 
tctacgtcag tccagctgcc gtggcaaaga 
atcaaccatc aaacacagac cacctgttgg 
tcccttgctg. acctgaataa tgtacgtttt 
agactacaaa aagcactatg tttggatctc 
aaacagcaca agttgaacca aaatgaccag 
ctgacaacaa cttatgatgg acttgagcaa 
tgtgttgata tgtgtctcaa ttggttgctc 
attagagtgc agagtctgaa gattggatta 
aaatacagat atctctttaa ggaagttgcg 
ctgggcctgt tacttcatga tgccatccag 
tttggaggca gtaatattga gcctagtgtt 
ccagaaataa gtgtgaaaga gtttatagat 
tggctcccag ttttacatcg agtggcagca 
aacatctgta aagaatgtcc aattgtcggg 
tatgatgtct gccagagttg tttcttttcg 
tacccaatgg tggaatattg tatacctaca 
aaggtactta agaacaagtt caggtcgaag 
tacctgcctg tccagacagt tcttgaaggt 
agtatgtggc cagagcacta tgacccctca 
cattcaagaa tagaacaata tgccacacga 
tttctcactg atagcagctc caccacagga 
cagtattgcc aaacactcgg aggagagtcc 
atcctgaagt cagtagagag ggaagaacgt 
gaggaagaac aaagaaatct acaggtggag 
agggggctcc ctgtcggttc accgccagag 
gattcagaac ttatagcaga agcaaaactc 
aggatgcaga ttttagaaga tcacaataaa 
cagctgctgg agcagcctga atctgattcc 
cctcagcatt ctgcactgag ctactcgctt 
caggcagcgg gagaggacct gctggcccca 
gtcatggagc agattcacag cacgtttcca 
caggcaatgt ga 



FIGURE 4 {Mouse Utrophin cDNA, Acc. No. Y12229, SEQ ID N0:4) 

1 atggccaagt atggggacct tgaagccagg cctgatgatg ggcagaacga attcagtgac 
61 atcattaagt ccagatctga tgaacacaat gatgtacaga .agaaaacctt taccaaatgg 
121 'ataaacgctc gattttccaa gagtgggaaa ccacccatca gtgatatgtt ctcagacctc 
181 aaagatggga gaaagctctt ggatcttctc gaaggcctca caggaacatc attgccaaag 
241 gaacgtggtt ccacaagggt gcatgcctta aacaatgtca accgagtgct acaggtttta 
301 catcagaaca atgtggactt ggtgaatatt ggaggcacgg acattgtggc tggaaatccc 
361 aagctgactt tagggttact ctggagcatc attctgcact ggcaggtgaa ggatgtcatg 
421 aaagatatca tgtcagacct gcagcagaca aacagcgaga ag.atcctgct gagctgggtg 
481 cggcagacca ccaggcccta cagtcaagtc aacgtcctca acttcaccac cagctggacc 
541 gatggactcg cgttcaacgc cgtgctccac cggcacaaac cagatctctt cgactgggac 
601 gagatggtca aaatgtcccc aattgagaga cttgaccatg cttttgacaa ggcccacact 
661 tctttgggaa ttgaaaagct cctaagtcct gaaactgttg ctgtgcatct ccctgacaag 
721 aaatccataa ttatgtattt aacgtctctg tttgaggtgc ttcctcagca agtcacgata 
781 gatgccatcc gagaggtgga gactctccca aggaagtata agaaagaatg tgaagaggaa 
841 gaaattcata tccagagtgc agtgctggca gaggaaggcc agagtccccg agctgagacc 
901 cctagcaccg tcactgaagt ggacatggat ttggacagct accagatagc gctagaggaa 
961 gtgctgacgt ggctgctgtc cgcggaggac acgttccagg agcaacatga catttctgat 
1021 gatgtcgaag aagtcaaaga gcagtttgct acccatgaaa cttttatgat ggagctgaca 
1081 gcacaccaga gcagcgtggg gagcgtcctg caggctggca accagctgat gacacaaggg 
1141 actctgtcca gagaggagga gtttgagatc caggaacaga tgaccttgct gaatgcaagg 
1201 tgggaggcgc tccgggtgga gagcatggag aggcagtccc ggctgcacga cgctctgatg 
^ 1261 gagctgcaga agaaacagct gcagcagctc tcaagctggc tggccctcac agaagagcgc 

□ 13'21 attcagaaga tggagagcct cccgctgggt gatgacctgc cctccctgca gaagctgctt 
[V 1381 caagaacata aaagtttgca aaatgacctt gaagctgaac aggtgaaggt aaattcctta 
m 1441 actcacatgg tggtgattgt ggatgaaaac agtggggaga gtgccacagc tcttctggaa 
« 1501 gatcagttac agaaactggg tgagcgctgg acagctgtat gccgctggac tgaagaacgt 
J2 1561 tggaacaggt tgcaagaaat cagtattctg tggcaggaat tattggaaga gcagtgtctg 
w 1621 ttggaggctt ggctcaccga aaaggaagag gctttggata aagttcaaac cagcaacttt 
£ 1681 aaagaccaga aggaactaag tgtcagtgtc cggcgtctgg ctatattgaa ggaagacatg 
m 1741 gaaatgaaga ggcagactct ggatcaactg agtgagattg gccaggatgt gggccaatta 

1801 ctcagtaatc ccaaggcatc taagaagatg aacagtgact ctgaggagct aacacagaga 

■ 1861 tgggattctc tggttcagag actcgaagac tcttctaacc aggtgactca ggcggtagcg ' 

1921 aagctcggca tgtcccagat ' tccacagaag gacctattgg agaccgttca- tgtgagagaa 

□ 1981 caagggatgg tgaagaagcc caagcaggaa ctgcctcctc ctcccccacc aaagaagaga 
q 2041 cagattcacg tggacgtgga ggccaagaaa aagtttgatg ctataagtac agagctgctg 
S 2101 aactggattt tgaaatcaaa gactgccatt cagaacacag agatgaaaga atataagaag 

2161 tcgcaggaga cctcaggaat gaaaaagaaa ttgaagggat tagagaaaga acagaaggaa 

□ 2221 aatctgcccc 'gactggacga actgaatcaa accggacaaa ccctccggga gcaaatggga 
Q 2281 aaagaaggcc ttccactgaa agaagtaaac gatgttctgg aaagggtttc gttggagtgg 

2341 aagatgatat ctcagcagct agaagatctg ggaaggaaga tccagctgca ggaagatata 
2401 aatgcttatt ttaagcagct tgatgccatt gaggagacca tcaaggagaa ggaagagtgg 
24 61 ctgaggggca cacccatttc tgaatcgccc cggcagccct tgccaggctt aaaggattct 
2521 tgccagaggg aactgacaga tctccttggc cttcacccca gaattgagac gctgtgtgca 
2581 agctgttcag ccctgaagtc tcagccctgt gtcccaggtt ttgtccagca gggttttgac 
2641 gaccttcgac atcattacca ggctgttgcg aaggctttag aggaatacca acaacaacta 
2701 gaaaatgagc tgaagagcca gcctggaccc gagtatttgg acacactgaa taccctgaaa 
2761 aaaatgctaa gcgagtcaga aaaggcggcc caggcctctc tgaatgccct gaacgatccc 
2.821 atagcggtgg agcaggccct gcaggagaaa aaggcccttg atgaaaccct tgagaatcag 
2881 aaacatacgt tacataagct ttcagaagaa acgaagactt tggagaaaaa tatgcttcct 
2941 gatgtgggga aaatgtataa acaagaattt gatgatgtcc aaggcagatg gaataaagta 
3001 aagaccaagg tttccagaga cttacacttg ctcgaggaaa tcacccccag actccgagat 
3061 tttgaggctg attcagaagt cattgagaag tgggtgagtg gcatcaaaga cttcctcatg 
3121 aaagaacagg ctgcccaagg agacgctgct gcgcagagcc agcttgacca atgtgctacg 
3181 tttgctaatg aaatcgaaac catcgagtca tctctgaaga acatgaggga agtagagact 
3241 agccttcaga ggtgtccagt cactggagtc aagacatggg tacaggcaag actagtggat 
. 3301 taccaatccc aactggagaa attcagcaaa gagattgcta ttcaaaaaag caggctgtta 
3361 gatagtcaag aaaaagccct gaacttgaaa aaggatttgg ctgagatgca ggagtggatg 
3421 gcacaggctg aagaggacta cctggagagg gacttcgagt acaaatctcc agaagaactc 
3481 gagagtgcgg tggaggaaat gaagagggca aaagaggatg tgctgcagaa ggaggtgagg 
3541 gtgaaaattc tgaaggacag catcaagctg gtggctgcca aggtgccctc tggtggccag 
3601 gagttgacgt cggaattcaa cgaggtgctg gagagctacc agcttctgtg caatagaatt 



FIGURE 4 (cont.) 

3661 cgagggaagt gccacacact ggaggaggtc 
3721 ctggacctgg agaccacgtg gttgaacacc 
3781 ctgcctgaga gggcagaagc tgttcatgaa 
3841 catccagcgg ataatcgcac ccagattcgg 
3901 atcctggatg acataatcag cgagaagctg 
3961 agtcacttgg cggagagcaa acagatttct 
4021 actgaccaca tgcttcaggt gctgaaggag 
4081 acatacctga cggacaggat cgatgccttc 
4141 gccgaaatct cagcccatga gctcaccctg 
4201 cccccgacgt cccctgaggg cagggccacc 
4261 aggaaacttc gagaggtctc caccaaattc 
4321 cagcggatgc tggactgcaa gcgtgtgttg 
4381 gatgtgaggg atgtggaccc tgatgtcatt 
44 41 tataaaacgt tgagtgaagt caaacttgaa 
4501 attgtccaga agcagcagac' ggacaacccg 
4561 aaagtcctct acaatgacct gggcgcacag 
4 621 gcctcacagc tgtccaggaa gatgaagaag 
46B1 gccacagagg cagaactagt gcagaaatcc 
4741 acagaaatct cctgggctaa aagtattctc 
4801 aatggcatta cagagagcag tgctgccctt 
4861 ctggaagaga acctctgtgt gctcaatgct 
4 921 gactggtgca acaccttgct gaaccatcaa 
4 981 gctcacatca gtacctggct ctatcaagca 
5041 ccagcgagta aacaggaaga aattgtgaag 
5101 ctccaggttg agaatgttcg ggaacaagcc 
5161 agcagggaac tcgtggaacc aaaattagcc 
5221 cagcacataa agagcgcccg aatgctgatt 
5281 gaccctgctg gaactgttca agctgctgag 
5341 gacatagaaa acatgttgaa agttgtggaa 
5401 atggatgagg agcaagccca gattgaggaa 
54 61 gaacctatgg aggacagtaa gaaagaaaag 
5521 cgttacaaca aaattaagac aatccctatc 
5581 ggaattacat catcagccct ccctgcagat 
5641 actctggatg acattgaatt atcacttaat 
5701 gacttctctt tccaggaaga ctctctgaag 
5761 gagcagattg cagttgttca cgagaagcag 
5821 gaggccattc agatcaggga catgctcgct 
5881 agagtgtaca gtgatcggag agggtccttt 
5941 caccatgacc ttgatgacct tacacagtgg 
6001 acttgtgctc cagatggtag cctggacctg 
6061 gaagagggcc tcagcagcca ccagcccagc 
6121 cttgttcaga gactccgccc ctcggaggca 
6181 aaccagcgct ggagcactct tgtagctgag 
6241 gaaagtcagc aggtgttggg gtataagaga 
6301 aaagtggaga -gtgctgtgca gaagagatca 
6361 ttaacagatt tagcccaaga gacggaagtt 
6421 gcagaactgg aaatgctttc agacaaaaat 
6481 gaaagtttaa agaatgtaaa cacaacatgg 
6541 ctgaagacac gcacccaaga cccctgctct 
6601 aacgtccaaa aggtggtgct agtatcatct 
6661 gaaatctcgg ttcctgctga tttggataaa 
6721 ttgatcgacc aaatgctgaa gtccaacatt 
6781 aagacagttt cccggatgaa aatcacaaag 
6841 gattgtgtat ttacgttggc ccaaaatttg 
6901 acagcaatca cagaaaaatt ggaaaagctg 
6961 gtggagctgc ggcggcagca gctggaggac 
7021 cacagggaag agactgaaga gctcatgaga 
7081 caggcccgcc gggacccact tagcaaacaa 
7141 ctggggtctg gcgatggtgt catcatggcg 
7201 gaatacagtg gcgatgacac aaggaatgtg 
7261 tgggtcaatc tcaaacaaag catcgctgat 
7321 acagtgcaga cttctcgtag agacctggag 
7381 accacagcaa atgtgctggc cgatgcctct 
74 41 ctggcccggc agctccgaca gcagatgctg 



tggtcttgct gggtggagct, gctteactat 
ttggaggagc gcgtgaggag cacggaggcc 
gctctggagt ctcttgagtc tgttttgcgc 
gaacttgggc agactctgat tgatggtgga 
gaggctttta acagccgcta cgaagagctg 
ttggagaagc aactccaggt cctccgcgaa 
agcctggggg agctggacaa acagcttacc 
caactgccac aggaagctca gaagatccaa 
gaggagctga ggaagaatgt gcgctcccag 
agaggaggaa gtcagatgga catgctacag 
cagcttttcc agaagcccgc aaatttcgag 
gagggagtga aggccgagct tcatgtcctc 
caggcccact tggacaagtg catgaaacta 
gttgagactg tcatcaaaac agggaggcac 
aaaagcatgg acgaacagct tacatctctg 
gtgacagaag ggaagcaaga cctggaaaga 
gaggctgccg tcctctctga atggctctct 
acatcagaag" gcgtgattgg tgacctggac 
aaggatctgg aaaagaggaa agttgactta 
cagcacttgg tcttgggcag tgagtctgtt 
ggatggagcc gagtgcggac gtggaccgaa 
aaccagctgg agctatttga tggacacgtc 
gaagctctgt tggatgagat cgaaaagaaa 
cgtttactgt ctgaattgga tgatgccagc 
atcatcttgg tgaatgctcg tggaagcgcc 
gagctgagca ggaactttga aaaggtgtcc 
ggtcaggacc cttcatccta ccaaggcttg 
tctttctctg acttggaaaa cttagaacaa 
aagcacttgg accccaataa cgatgagaag 
gttctacaaa gaggggagca tttgttacat 
atccgcttgc agttgttact tttgcatact 
cagcagagaa aaacaattcc agtttcttct 
tatttggttg aaattaataa aattttactc 
atgccggagc taaacaccac tgtctacaaa 
agtatcaaag gtcaactgga cagacttgga 
ccggatgtca tcgtggaagc ctctggccct 
cagctgaacg caaaatggga ccgagtgaat 
gccagggctg tggaggaatg. gaggcagttc 
ctatctgaag ctgaagacct gctggtagac 
gagaaagcca gggcacagca gctggaactg 
ctgatcaagg ttaaccgaaa gggggaggac 
agcttcctga aggagaagct ggcaggtttc 
gtggaggctt tgcagcccag gctaaaagga 
cggctagatg aggtcacctg ctggttaacg 
acccctgacc cggaagaaag cccacaggaa 
caagctgaaa acattaagtg gctgaacaga 
ctgagtttgc gtgaaagaga gaaactttcg 
accaaggtat gcagagaagt gcctagcctc 
gccccacaga tgaggatggc tgctcatccc 
gcatcagatg ctcctctgcg tggcggcctg 
accatcacag aactggctga ctggctggta 
gtcactgtgg gggacgtgaa agagatcaat 
gctgatttag aacaacgcca tcctcagctt 
aaaaacaaag -cttccagttc agatgtgaga 
aagacccagt gggagagtac tcagcatggt 
atggttgtgg acagcctgca gtgggacgac 
aaatacgagg ctcgcttcta catgctgcag 
gtttctgata atcaactatt gcttcaagag 
tttgataatg tcctgcagaa acttctggaa 
gaagaaacca cggagtactt gaaaacatca 
agacagagtg ccttggaggc tgagctacag 
aactttgtca agtggcttca ggaagcagaa 
cagcgggaga atgctcttca ggacagtgtc 
gacatccagg cagaaattga tgcccacaat 



FIGURE 4 (cont.) 

7 501 gacatattta aaagcatcga tggaaaccgg 
7561 gaggaagcaa caatgcttca acatcgactg 
7621 aaggcaaaat ctgctagcat cagggcccat 
7 681 ttgctggcat cgctggaaga gctgatcaaa 
7741 aagcagatgc ccattggagg ggacgtccct 
7801 gtgctgagac gtgagctaaa ggagaaagag 
7861 cgagtttttc tggctgatca gccaatagag 
7 921 tcaaagacag agttgactcc tgaggagaga 
7 981 cagtcttctg aagtccgaga gaagtgggaa 
8041 aagcaagtag ggaaggcgtt agagaaactc 
8101 gacgcagaca tgaaggaggt ggaggctgtg 
8161 cttatagact ccctgcagga tcacatcgag 
8221 ccaatcaact taaaagtaaa aacaatgaat 
8281 ttgcatccat ctctaaagat gtctcgccag 
8341 ctacaggttt ccgtggacga tcgccttaag 
8401 ccatcttctc aacactttct gtccacttca 
84 61 cataataaag tgccctatta catcaaccat 
8521 aaaatgactg agctcttcca atcccttgct 
B581 cgcacagcaa tcaaaattcg aaggctgcaa 
8641 aatacgacga atgaagtttt caagcagcac 
8701 gtcccagacg tcatcaactg tctgaccacc 
8761 gacttggtca atgttccact ctgcgtcgat 
8821 gacacgggcc ggactggaaa aattcgggta 
8881 tccaaaggcc tcttagaaga gaaatacaga 
8941 gagatgtgtg accagcggca gcttggcctg 
9001 cagctggggg aagtagcagc ctttgggggc 
9061 ttccagcaga ataacaacaa gccagaaatc 
9121 ttggaacccc agtccatggt gtggttgccg 
9181 gcaaaacatc aggcqaaatg caacatctgc 
9241 aggagcctaa agcattttaa ttatgatgtc 
9301 gcaaagggcc acaagttaca ttacccgatg 
9361 gaagatgtga gagatttcac taaggtgctg 
9421 gccaaacatc ctcggcttgg ctacctgcct 
9481 gaaactccta tcacgctcat cagtatgtgg 
9541 cagctgtttc atgatgacac ccactcaaga 
9601 atggaaagga caaacgggtc cttcctaact 
9661 gatgagcatg ccctcatcca gcagtactgc 
9721 cagccgcaga gtccagctca gatcctgaag 
9781 gagcggatca ttgctgactt ggaggaagag 
9841 ctgaaggagc agcacctaag aaggggtctc 
9901 tctcctcacc acacatctga ggactcagaa 
9961 cacaaagggc ggctggaggc gaggatgcaa 
10021 tctcagctgc accgcctcag acagctcctg 
10081 ggtgtctccc cctgggcttc cccacagcat 
10141 ccaggcccac agttccacca ggcagcatct 
10201 agcacggacc tcacggacgt gatggagcag 
10261 aatgtcccca gcaggccaca ggcaatgtga 
10321 ccttcagtat tgccctcttc tgcaaatgcc 
10381 cgtggctcca cgacacaagc tgttgagtgc 
10441 actgactatc caaagatatt ttggttttct 
10501 ctttctatgc aactgtaaat taatgaacag 
10561 cactgatttg . tataatatat acagccatgg 
10621 ctgtcttttt aataaccaag agaaaaaatt 
10681 cattccttct gctgttcaca ttaaccttgt 
10741 taccattatg ttttggttat ttataaattt 
10801 tatttggttt ctataatctg gccaaattcc 
10861 catatgtgga gtaggcaggc attctgaaga 
10921 accatccttt agtaaccact ggatgactac 
10981 ctgctggggt atttttcctc ctgtttttat 
11041 tatttatttc atcttgcaat ctctataata 



cagaagatgg tgaaagctct ggggaattct 
gatgacatga accaaagatg gaatga_tttg 
ttggaggcca gtgctgagaa atggaaccgg 
tggctcaata tgaaagatga ggagcttaag 
gccttacagc tccagtatga ccactgcaag 
tattctgtgc tgaacgccgt agatcaagct 
gcccccgaag aaccaagaag aaacccacaa 
gcccagaaga tcgccaaagc catgcgcaag 
aatctaaatg ctgtcactag caactggcaa 
cgagacctgc agggagctat ggacgacctg 
cggaatggct ggaagcccgt gggagacctg 
aaaaccctgg cgtttagaga agaaattgca 
gacctgtcca gtcagctgtc tccacttgac 
ctggatgacc ttaatatgcg atggaaactt 
cagctccagg aagcccacag agattttggg 
gtccagctgc cgtggcagag atccatttca 
caaacacaga caacctgttg ggatcatcct 
gatctgaata atgtacgttt ctctgcctac 
aaagcattat gtctggatct cttagagctg 
aaactgaacc aaaatgatca gctcctgagt 
acttacgatg ggcttgagca gctgcacaag 
atgtgtctca actggctgct caacgtatac 
cagagtctga agattggatt gatgtctctc 
tgtctcttta* aggaggtggc agggccaact 
ctacttcacg atgccatcca gatccctagg 
agtaacattg agcccagtgt ccgcagctgc 
agtgtgaagg agtttataga ctggatgcat 
gttctgcatc gggtcgcagc tgctgagact 
aaagaatgcc cgattgttgg gttcagatac 
tgccagagtt gcttcttttc tggaagaaca 
gtagaatact gcataccgac aacatctggg 
aagaacaagt tcaggtccaa .gaaatatttt 
gtccagaccg tgctggaagg ggacaactta 
ccagagcact atgacccctc ccagtcccct 
atagagcaat acgctacacg actggcccag 
gatagcagct ctacaacagg aagcgtggag 
cagaccctgg gcggggagtc acctgtgagt 
tccgtggaga gggaagagcg tggggaactg 
caaagaaatc tgcaggtgga gtatgagcag 
cctgtgggct cccctccaga ctccatcgta 
cttatagcag aagctaaact cctgcggcag 
attttggaag atcacaataa acagctggag 
gagcagcctg actctgactc ccgcatcaat 
tctgcattga gctactcact tgacactgac 
gaggacctgc tggccccacc tcacgacact 
atcaacagca cgtttccctc ttgcagctca 
gcatctatcc agccagccaa catttcccga 
aatcccaaga cccattcaac cccaaagctc 
ttactgggtg ttctactgag ggaaccaaac 
aataacgtat attattgttt tctttctccc 
agaagtattt ggaggtggta aagcatttgt 
gaaagtgggt gggggctttc taatatgaaa 
gcataagaat tagaccactt tacattatta 
acaataactt .cacttattat ttgactgttt 
atcagccata ccaaacgaat agattctatg 
taagttcata tatttgaatc aaatatttta 
tactatttaa ctttagttga cgtcacacac 
actaaaaatc ctgtggactt taacggcaag 
tccttttttg taagtagatc ttgacgtctt 
aagaagactg tattgtaata gtcccc 
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SE ? ^a?;L^r=fc?U^^U, f c t cagatctggg aggcaattac cttcggagaa 

& tcaffi lll?cT& "tt?tacca ggtttttttt 

181 atcgctgcct tgatatacac ttttcaaa 

SEQ ID NO: 6 (N terminus, 209-964) 

i SSffSsa =| 113 

361 laacctcctc gaaggcctga cagggcaaaa actgccaaaa gaaaaaggat ccacaagagt 
«1 watgccctg laclatgtca acaaggcact gcgggttttg cagaacaata atgttgattt 
481 agtgiatatt ggaagtactg acatcgtaga tggaaatcat aaactgactc. "99"tgat 
5U Ugfaatata Itcctccact ggcaggtcaa aaatgtaatg ^aaatatca tggctggatt 
601 acaacaaacc aacagtgaaa agattctcct gagctgggtc cgacaatcaa ctcgtaatta 
66 tccacaggtt aatgtaltca acttcaccac cagctggtct gatggcctgg ""gaatgc 
i?i £ri-r.atccat aotcataqqc cagacctatt tgactggaat agtgtggttt gccagcagtc 
ill aaccacacaa cgactggllc at^cattcaa catcgccaga tatcaattag g«tagagaa 
II] a?tactcaat cctoaaqatg ttgataccac ctatccagat aagaagtcca tcttaatgta 
loi catcaca?ca ctc??ccaal ttltgcctca acaagtgagc attgaagcca tccaggaagt 
961 ggaa 

SEQ ID NO;7 (Hinge 1, 965-1219) „n.n.„ a „* taratcatca 

965 atgttg ccaaggccac ctaaagtgac taaagaagaa ca ""«ft tacatcatca 
1021 aatgcactat tctcaacaga tcacggtcag tctagcacag 9«*tatgaga 9»«ttcttc 
1081 ccctaaqcct cgattcaaga gctatgccta cacacaggct gcttatgtca ccacctctga 
1141 ccctlclcgg agcccatttc cttcacagca tttggaagct cctgaagaca agtcatttgg 
1201 cagttcattg atggagagt 

SEQ ID tJO:8 (Repeat 1, 1220-1546) 

1220 g aagtaaacct ggaccgttat caaacagctt tagaagaagt ..,...,.„ 

1261 Ittatcgtgg cttctttctg ctgaggacac attgcaagca caaggagaga "tctaatga 

1321 tgtggaagtg gtgaaagacc agtttcatac tcatgagggg tacatgatgg a"tgacagc 

1381 ccatcagggc cgggttggta atattctaca attgggaagt aagctgattg ^acaggaaa 

1441 attatcagaa gltgaagaaa ctgaagtaca agagcagatg aatctcctaa attcaagatg 
1501 ggaatgcctc agggtagcta gcatggaaaa acaaagcaat ttacat 

SEQ ID NO:9 (Repeat 2, 1547-1879) 

^tccagaafclgaaactga aagagttgaa tgactggcta acaaaaacag aagaaagaac 
1621 aaggaalatg gaggaagagc ctcttggacc tgatcttgaa 9'ccfcaaaac |«aagtaca 
1681 acaacataag gtgcttcaag aagatcta'ga acaagaacaa gtcagggtca attctctcac 
1741 tcacatggtg gtggtagttg atgaatctag tggagatcac gcaactgctg ctttggaaga 
1801 auacttaag gtlttgggag atcgatgggc aaacatctgt agatggacag aagaccgctg 
1861 ggttctttta caagacatc 

SEQ ID NO:X0 (Repeat 3, 1880-2212) • 
1880 c ttctcaaatg gcaacgtctt actgaagaac agtgcctttt . 
1921 tagtgcatgg ctttcagaaa aagaagatgc agtgaacaag attcacacaa etggcttta. 
1981 agltcaaaat gaaatgttat caagtcttca aaaactggcc gttttaaaag cggatctaga 
2041 aaagaaaaag caatccatgg gcaaactgta ttcactcaaa caagatcttc "tcaacact 
2101 qaagaataag tcagtgaccc agaagacgga agcatggctg gataactttg cccggtgttg 
2161 ggataattta gtccaaaaac ttgaaaagag tacagcacag atttcacagg ct 

SEQ ID NO:ll (Hinge 2, 2213-2359) 

2221 cactcagcca tcactaacac agacaactgt aatggaaaca gtaactacgg tgaccacaag 
2281 ggaacalatc ctggtaaagc atgctcaaga ggaacttcca ccaccacctc cccaaaagaa 
2341 gaggcagatt actgtggat 
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SEQ ID N0:12 (Repeat 4, 2360-2692) 
2360 t ctgaaattag gaaaaggttg gatgttgata taactgaact rastctttca 
2401 tcacagctgg Ittactcgct cagaagctgt gttgcagagt cctgaatttg "^ctttcg 
2461 gaaggaaggc aacttctcag acttaaaaga aaaagtcaat gccatagagc gag aaa «gc 
2521 tgaglagttc agaaaactgc aagatgccag cagatcagct "ggccctgg tggaacagat 
2581 ggtgaatgag ggtgttaatg cagatagcat caaacaagcc tcagaacaac tgaacagccg 
2641 gtggatcgai ttctgccagt tgctaagtga gagacttaac tggctggagt at 

SEQ ID NO: 13 (Repeat 5, 2693-3019) 

270? catcttcgct ttctataatc agctacaaca attggagcag atgacaacta ctgctgaaaa 

2761 ctggttgaaa atccaaccca ccaccccatc agagccaaca gc^ttaaaa 

2821 aatStgtaag gatgaagtca accggotatc aggtcttcaa cctcaaattg "^"aaa 

?flA1 aat-.tr.aaaac ataaccctqa aagagaaagg acaaggaccc atgttcctgg atgcagactt 

IS" ?gtggcc?tt acalatcatt ttaagcaagt cttttctgat gtgcaggcca gagagaaaga 

3001 gc'tacagaca atttttgac 

SEQ ID NO: 14" (Repeat 6, 3020-3346) 

3020 a ctttgccacc aatgcgctat caggagacca tgagtgccat n+rrtrarrn* 

^nfil caoaacataa atccaqcagt cagaaaccaa actctccata cctcaactta gtgtcaccga 

3121 cta1gaaa?c 'atggagcaga gactcgggga attgcaggct ttacaaagtt ctctgcaaga 

3181 gcaacaaagt ggcctatact atctcagcac cactgtgaaa gagatgtcga agaaagcgcc 

3241 ctctgaaatt agccggaaat atcaatcaga atttgaagaa attgagggac gctggaagaa 
3301 gctctcctcc cagctggttg agcattgtca aaagctagag gagcaa 

SEQ ID NO: 15 (Repeat 1, 3347-3673) 
3347 atga ataaactccg 

3361 aaaaattcag aatcacatac aaaccctgaa gaaatggatg gctgaagttg Jtgtttttct 
3421 gaaggaggal tggcctgccc ttggggattc agaaattcta aaaaagcagc tgaaacagtg 
3481 cagactttta gtcagtgata ttcagacaat tcagcccagt ctaaacagtg tcaatgaagg 
3541 tgggcagaag ataalgaatg aagcagagcc agagtttgct tcgagacttg agacagaact 
3601 cawgHctt aacactcagt gggatcacat gtgccaacag gtctatgcca gaaaggaggc 
3661 cttgaaggga ggt 

SEQ ID NO:16 (Repeat 8, 3674-4000) 

3674 ttqqaga aaactgtaag cctccagaaa gatctatcag agatgcacga ^^ a/Ta 

3721 atggatgaca caagct^aag aagagtatct tgagagagat tttgaatata aaactccaga 

3781 tgaattacag aaagcagttg aagagatgaa gagagctaaa gaagaggccc aacaaaaaga 

3841 agcgaaagtg aaactcctta ctgagtctgt aaatagtgtc atagctcaag ctccacctgt 

3901 agcacaagag gccttaaaaa aggaacttga aactctaacc accaactacc agtggctctg 

3961 cactaggctg aatgggaaat gcaagacttt ggaagaagtt 

SEQ ID NO: 17 (Repeat 9, 4001-4312) 

Scafac g£3SU caaacaagtg gctaaatgaa gtagaattta aacttaaaac 
4081 cactgaaaac attcctggcg gagctgagga aatctctgag gtgctagatt "Cttgaaaa 
4141 tttgatgcga cattcagagg ataacccaaa tcagattcgc atattggcac ag«cctaac 
4201 agatggcgga gtcatggatg agctaatcaa tgaggaactt gagacattta attctcgttg 
4261 gagggaacta catgaagagg ctgtaaggag gcaaaagttg cttgaacaga gc 

SEQ ID NO:18 (Repeat 10, 4313-4588) 

4 321 tgcccaggag actgaaaaat ccttacactt aatccaggag tccctcacat tcattgacaa 

4381 gcagttggca gcttatattg cagacaaggt ggacgcagct caaatgcctc ^^agccca 

4441 gaaaatccaa tctgatttga caagtcatga gatcagttta gaagaaatga agaaacataa 

4501 tcaggggaag gaggctgccc aaagagtcct gtctcagatt gatgttgcac agaaaaaatt 
4 561 acaagatgtc tccatgaagt ttcgatta 
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SEQ id N0:19 (Repeat 11, 4589-4915) 
4589 tt ccagaaacca gccaattttg agctgcgtct ».f«^fnr a i- fnaaaacaaa 

4621 acaagaaagt aagatgattt tagatgaagt gaagatgcac ttgcctgcat tggaaacaaa 

;s!i asas sssei sas ssss ssssss »H 
Hi; dsss S55S2 sees ssssss ssass ss?- 

SEQ ID NO: 20 (Repeat 12, 4916-5239) 

4921 atfgtcccgt aagatgcgaa aggaaatgaa tgtcttgaca ffaatggctgg "gctacaga 

4981 tatggaattg acaaagagat cagcagttga aggaatgcct agtaatttgg ?"ctgaagt 

5041 tgcctgggga aaggctactc aaaaagagat tgagaaacag aaggtgcacc tgaagagtat 

5101 clcaqaggta ggagaggcct tgaaaacagt tttgggcaag aaggagacgt tggtggaaga 

5161 taaactclg? Ittctglata g?aactggat agctgtcacc tcccgagcag aagagtggtt 

5221 aaatcttttg ttggaatac 

SEQ ID NO: 21 (Repeat 13, 5240-5551) 
5240 c agaaacacat ggaaactttt gaccagaatg tggaccacat a( , aaaC ccca 

5281 cacaaagtgg atcattcagg ctgacacact tttggatgaa tcagagaaaa agaaacccca 

5341 gcaaaaagaa gacgtgotta agcgtttaaa ggcagaactg aatgacatac 9«caaaggt 

5401 ggactctaca cgtgaccaag cagcaaactt gatggcaaac cgcggtgacc actgcaggaa 

54 61 Ittagtagag ccccaaatct cagagctcaa ccatcgattt gcagccattt cacacagaat 

5521 taagactgga aaggcctcca ttcctttgaa g 

SEQ ID NO: 22 (Repeat 1*, 5552-5833) 
SSS? naattaaaa caatttaact cagatataca 

558? IwiSgrtt glaccactgg a^gctgaaat tcagcagggg gtgaatctga aagaggaaga 
5641 cttcaataaa gatatgaatg aagacaatga gggtactgta aaagaattgt tgcaaagagg 
5701 agacaactta caacaaagaa tcacagatga gagaaagaga gaggaaataa ^ataaaaca 
57 61 gcagctgtta cagacaaaac ataatgctct caaggatttg aggtctcaaa gaagaaaaaa 
5821 ggctctagaa att 

SEQ ID NO:23 (Repeat 1.5, 5834-6127) 

5834 tctcatc agtggtatca gtacaagagg caggctgatg atctcctgaa .„ aaam , aa 

5881 atgcttggat gacattgaaa aaaaattagc cagcctacct gagcccagag atgaaaggaa 

5941 aataaaggaa attgatcggg aattgcagaa gaagaaagag gagctgaatg cagtgcgtag 

6001 gcaagctgag ggcttgtctg aggatggggc cgcaatggca 9tggagccaa ctcagatcca 

6061 gctcagcaag cgctggcggg aaattgagag caaatttgct cagtttcgaa gactcaactt 
6121 tgcacaa 

SEQ ID NO: 24 (Repeat 16, 6188-6514 ) 

6128 tct tatgtgcctt ctacttattt gactgaaatc actcatgtct cacaagccct 
6241 attagaagtg gaacaacttc tcaatgctcc tgacctctgt gctaaggact ttgaagatct 
6301 ctttaagcaa gaggagtctc tgaagaatat aaaagatagt ctacaacaaa gctcaggtcg 
6361 gattgacatt attcatagca agaagacagc agcattgcaa agtgcaacgc ctgtggaaag 
6421 ggtgaagcta caggaagctc tctcccagct tgatttccaa tgggaaaaag ttaacaaaat 
6481 gtacaaggac cgacaagggc gatttgacag atct 

SEQ ID NO; 25 (Repeat 17, 6515-6835)' 
6515 gttgag aaatqgcggc gttttcatta 

6541 tgatataaag atatttaatc agtggctaac agaagctgaa cagtttctca gaaagacaca 
6601 aattcctgag aattgggaac atgctaaata caaatggtat cttaaggaac tccaggatgg 
6661 cattgggcag cggcaaactg ttgtcagaac attgaatgca actggggaag aaataattca 
6721 gcaatcctca aaaacagatg ccagtattct acaggaaaaa ttgggaagcc tgaatctgcg 
6781 gtggcaggag gtctgcaaac agctgtcaga cagaaaaaag aggctagaag aacaa 
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SEQ ID N0:26 (Repeat 18, 6836-7186) 

lw ?Atc?tatca aaatttcaaa gagatttaaa tgaatttgtt ttatggttgg aggaagcaga 

6901 taacatSc? Igfatccclc ft^aacctgg alaagagcag caactaaaag "aagcttga 

fiQfii acaaatcaaa ttactqqtg'g aagagttgcc cctgcgccag ggaattctca aacaattaaa 

702 ?gatactggl ggac^gtgc ttgtaaglgc tcccataagc ccagaagagc aagataaact 

7081 tgaaaatllg ctcaagcaga caaatctcca gtggataaag gtttccagag ctttacctga 

7141 gaaacaagga gaaattgaag ctcaaataaa agaccttggg cagctt 

SEQ ID NO:27 (Repeat 19, 7187-7489} 

? 7^oI S^tUS C 53Sagtta. atcatctgct gctgtggtta tctcctatta ggaatcagtt 
7261 ggaaatttat aaccaaccaa accaagaagg accatttgac gttcaggaaa ^tgaaatagc 
7321 agttcaagct aaacaaccgg atgtggaaga gattttgtct aaagggcagc atttgtacaa 
7381 ggaaaaacca gccactcagc cagtgaagag gaagttagaa gatctgagct ctgagtggaa 
74 41 ggcggtaaac. cgtttacttc aagagctgag ggcaaagcag cctgaccta 

SEQ ID 110:28 (Hinge 3, 7490-7612) 

750°1 SacScS?S C 5gagcctctc ctactcagac tgttactctg ? tgacacaac ctgtggttac 
7561 taaggaaact gccatctcca aactagaaat gccatcttcc ttgatgttgg ag 

SEQ ID NO:29 (Repeat 20, 7613-7942) 

762? tc?ggclgat ttcaaccggg cttggacaga acttaccgac tggctttctc tgcttgatca 

7681 agttStaaaa tcacagaggg tgatggtggg tgaccttgag Satatcaacg ^tgatcat 

7741 clagcagaag gcaacaatgc aggatttgga acagaggcgt ccccagttgg jagaactcat 

7801 taccgctgcc caaaatttga aaaacaagac cagcaatcaa 9aggctagaa caatcattac 

7861 ggatcgaatt gaaagaattc agaatcagtg ggatgaagta caagaacacc ttcagaaccg 
7921 gaggcaacag ttgaatgaaa tg 

SEQ ID NO: 30 (Repeat 21, 7943-8269) 

7943 ttaaagga ttcaacacaa tggctggaag ctaaggaaga am «Tt:cccta 

7981 agctgagcag gtcttaggac aggccagagc caagcttgag tcatggaagg ^"ccta 

804L tacagtagat gcaatccaaa* agaaaatcac agaaaccaag cagttggcca fagacctccg 

8101 ccagtggcag acaaatgtag atgtggcaaa tgacttggcc ctgaaacttc tccgggatta 

8161 ttctgcagat gataccagaa aagtccacat gataacagag aatatcaatg cctcttggag 

8221 aagcattcat aaaagggtga gtgagcgaga ggctgctttg gaagaaact 

SEQ ID NO: 31 (Repeat 22, 8270-8617) 

9281 gcaaclgttc C cccctggacc tggaaaagtt tcttgcctgg cttacagaag ctgaaacaac 
8341 tgccaatgtc ctacaggatg ctacccgtaa ggaaaggctc ctagaagact ccaagggagt 
8401 aaaagagctg atgaaacaat ggcaagacct ccaaggtgaa attgaagctc acacagatgt 
8461 ttatcacaac ctggatgaaa acagccaaaa aatcctgaga tccctggaag gttccgatga 
8521 tgcagtcctg ttacaaagac gtttggataa catgaacttc aagtggagtg aacttcggaa 
8581 aaagtctctc aacattaggt cccatttgga agccagt 

SEQ ID NO:32 (Repeat 23, 8618-9004) 

8618 tct gaccagtgga agcgtctgca «- = = ^™rr,-* 

8641 ccttt'ctctg caggaacttc tggtgtggct acagctgaaa gatgatgaat taagccggca 

8701 ggcacctatt ggaggcgact ttccagcagt tcagaagcag aacgatgtac atagggcctt 

8761 caagagggaa ttgaaaacta aagaacctgt aatcatgagt actcttgaga ctgtacgaat 

8821 atttctgaca gagcagcctt tggaaggact agagaaactc taccaggagc ccagagagct 

8881 gcctcctgag gagagagccc agaatgtcac tcggcttcta cgaaagcagg ctgaggaggt 

8941 caatactgag tgggaaaaat tgaacctgca ctccgctgac tggcagagaa aaatagatga 

9001 gacc 
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SEQ ID NO:33 (Repeat 24, 9005-9328) „ a , nn .. c tcaaactqcq 

9005 cttgaa agactccagg aacttcaaga ggccacggat WCtggacc tcaagctgcg . 
9061 ccalgctgag gtgatcaagg gatcctggca gcccgtgggc ^ctcctca "gactctct 
9121 ccaagatcac ctcgagaaag tcaaggcact tcgaggagaa Jttgcgcctc ^J"^ 
9181 cgtglgccac gtcaatgacc ttgctcgcca gcttaccact ttgggcattc ^tctcacc 
9241 gtataacctc agcactctgg aagacctgaa caccagatgg aagcttctgc aggtggccgt 
9301 cgaggaccga gtcaggcagc tgcatgaa 

SEQ ID NO:34 (Hinge 4, 9329-9544) 

SSSS ESSE S2SSS 2SS2 5S2» S5S3SS 

9541 actc 

Qirn to no-35 (Start of C terminus, 9545-10431) 

9545 SaSa ctgcaS.99 ccctttgctt ggatctcttg agcctgtcag =tgcatgtga 
9601 tgctttggac clgclcaacc tcaagcaaaa tgacoagccc atggatatcc t^agattat 
9661 tlattgtttg accactattt atgaccgcct ggagcaagag "caacaatt tggtcaacgt 
9721 ccctctctgc gtggatatgt gtctgaactg gctgctgaat gtttatgata cgggacgaac 
9781 agggaggatc cgtgtcctgt cttttaaaac tggcatcatt tccctgtgta "gcacattt 
98*1 ggllglcaag tlcigatacc ttttcaagca agtggcaagt tcaacaggat tttgtgacca 
0 9901 gcgcaggctg ggcctccttc tgoatgattc tatccaaatt ccaagacagt tgggtgaagt 

0 9961 tgcatccttt gggggcagta acattgagcc aagtgtccgg agctgcttcc aatttgetaa 
ffi- 10021 taataagcca glgatcgaag cggccctctt cctagactgg atgagactgg "ccccagtc 

1 | 1008I catqqtgtgg ctgcccgtcc tgcacagagt ggctgctgca gaaactgcca agcatcaggc 
W 10141 caaitotala atitgcaaag agtgtccaat cattggattc aggtacagga gtctaaagca 
£0 S I ""afttat gaca'tctgcl alagctgctt tttttctggt gagttgca. «ggccat,a 
03. 10261 aatgcactat cccatggtgg aatattgcac tccgactaca t"ggagaag. atgttcgaga 
JE 10321 ctttgccaag gtactaaaaa acaaatttcg aaccaaaagg tattttgcga agcatccccg 
jt 10381 aatgggctac ctgccagtgc agactgtctt agagggggac aacatggaaa c 

« ' SEQ ID NO;36 (alternatively spliced exons 71-78, 10432-11254) 

S lo"? tctgltctac ttctggccag .tagattctgc gcctgcctcg tcccctcagc "tcacacga 

2 10501 tgatactcat tcacgcattg aacattatgc tagcaggcta gcagaaatgg aaaacagcaa 

3 10561 taaatcttat ctaaatgata gcatctctcc taatgagagc atagatgatg aacatttgtt 
01 }o6 tllcotlcti tactgccaaa ftttgaacca ggactccccc ctg.gcc.gc ctcgtagtcc 
Q 10681 tgcccagatc ttgatttcct tagagagtga. ggaaagaggg «agct.g.ga flaatcetagc 
« 10741 agatcttgag gaagaaaaca ggaatctgca agcagaatat gaccgtctaa agcagcagca 
" 10801 cgaacataaa ggcctgtccc cactgccgtc ccctcctgaa atgatgccca cctctcccca 

10861 gagtccccgg gatgctgagc tcattgctga ggccaagcta ctgcgtcaac acaaaggccg 
10921 cctqqaagcc aggatgcaaa tcctggaaga ccacaataaa cagctggagt cacagttaca 
lllll cagJcSagg dgctgctgg agcalcccca ggcagaggcc aaagtgaatg ^caacggt 
11041 gtcctctcct tctacctctc tacagaggtc cgacagcagt cagcctatgc tgctccgagt 
11101 ggttggcagt caaacttcgg actccatggg tgaggaagat cttctcagtc ctccccagga 
11161 cacaagcaca gggttagagg aggtgatgga gcaactcaac aactcettcc ctagttcaag 
11221 aggaagaaat acccctggaa agccaatgag agag 

SEQ ID NO: 37 (End of coding region, 11255-11266) 
11255 gacaca atgtag 
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SEQ ID NO:38 (3' UTR, 11267-13957) 

U28 7 l ^ggcagatg'a'tgggcag agcgatggag tccttagtat cagtcatgac agatgaagaa 
11341 ggagcagaat aaatgtttta caactcctga ttcccgcatg gtttttataa tattcataca 
11401 Icaaagagga ttagacagta agagtttaca agaaataaat ctatattttt ggaagggta 
11461 gtggtattat actgtagatt tcagtagttt ctaagtctgt tattgttttg "aacaatgg 
11521 caggttttac acgtctatgc aattgtacaa aaaagttata agaaaactac atgtaaaatc 
11581 ttgitagcta aataacttgc catttcttta tatggaacgc a^ttgggtt gtttaaaaat 
11641. ttStaaSagt tataaagaaa gattgtaaac taaagtgtgc tttataaaaa aaagttgttt 
11701 ataaaaaccc ctaaaaacaa aacaaacaca cacacacaca cat.acacaca ^cacacaaa 
11761 actttgaggc agcgcattgt tttgcatcct tttggcgtga tatccatatg aa ^tcatgg 
11821 ctttttcttt ttttgcatat taaagataag acttcctcta ccaccacacc aaatgactac 
11881 tacacactgc tcatttgaga actgtcagct gagtggggca ggcttgagtt ttcatttcat 
11941 atatctatat gtctataagt atataaatac tatagttata tagataaaga gatacgaatt 
12001 tctatagact gactttttcc attttttaaa tgttcatgtc acatcctaat agaaagaaat 
12061 tacttctagt cagtcatcca ggcttacctg cttggtctag aatggatttt tcccggagcc 
12121 ggaagccagg aggaaactac accacactaa aacattgtct acagctccag atgtttctca 
12181 ttttaaacaa ctttccactg acaacgaaag taaagtaaag tattggattt ^tttaaaggg 
12241 aacatgtgaa tgaatacaca ggacttatta tatcagagtg agtaatcggt tggttggttg 
12301 attgattgaf tgattgatac attcagcttc ctgctgctag caatgccacg atttagattt 
12361 aatgatgctt cagtggaaat caatcagaag gtattctgac cttgtgaaca tcagaaggta 
m 12421 ttttttaact cccaagcagt agcaggacga tgatagggct ggagggctat gg a .ttcccag 

W 12481 cccatccctg tgaaggagta ggccactctt taagtgaagg attggatgat tgttcataat 

B 12541 acataaagtt ctctgtaatt acaactaaat tattatgccc tcttctcaca gtcaaaagga 

fy 12601 actgggtggt ttggtttttg ttgctttttt agatttattg tcccatgtgg gatgagtttt 

" 12661 taaatgccac aagacataat ttaaaataaa taaactttgg gaaaaggtgt aagacagtag 

w 12721 ccccatcaca tttgtgatac tgacaggtat caacccagaa gcccatgaac tgtgtttcca 

03'' 12781 tcctttgcat ttctctgcga gtagttccac acaggtttgt aagtaagtaa gaaagaaggc 

6J 12841 aaattgattc aaatgttaca aaaaaaccct tcttggtgga ttagacaggt taaatatata 

r 12901 aacaaacaaa caaaaattgc tcaaaaaaga ggagaaaagc tcaagaggaa aagctaagga 

£ 12961 ctggtaggaa aaagctttac tctttcatgc cattttattt ctttttgatt tttaaatcat 

539 13021 tcattcaata gataccaccg tgtgacctat aattttgcaa atctgttacc tctgacatca 

■ 13081 agtgtaatta gcttttggag agtgggctga catcaagtgt aattagcttt tggagagtgg 

U 13141 gttttgtcca ttattaataa ttaattaatt aacatcaaac acggcttctc atgctatttc 

n ■ 13201 tacctcactt tggttttggg gtgttcctga taattgtgca cacctgagtt cacagcttca 
S 13261 ccacttgtcc attgcgttat tttctttttc ctttataatt ctttcttttt ccttcataat 

u 13321 tttcaaaaga aaacccaaag ctctaaggta acaaattacc aaattacatg aagatttggt 

(H 13381 ttttgtcttg catttttttc ctttatgtga cgctggacct tttctttacc caaggatttt 

Q 13441 taaaactcag atttaaaaca aggggttact ttacatccta ctaagaagtt taagtaagta 

F-i 13501 agtttcattc taaaatcaga ggtaaataga gtgcataaat aattttgttt taatcttttt 

■ 13561 gtttttcttt tagacacatt agctctggag tgagtctgtc ataatatttg aacaaaaatt 

13621 gagagcttta ttgctgcatt ttaagcataa ttaatttgga cattatttcg tgttgtgttc 
13681 tttataacca ccgagtatta aactgtaaat cataatgtaa ctgaagcata aacatcacat 
13741 ggcatgtttt gtcattgttt tcaggtactg agttcttact tgagtatcat aatatattgt 
13801 gttttaacac caacactgta acatttacga attatttttt taaacttcag ttttactgca 
13861 ttttcacaac atatcagact tcaccaaata tatgccttac tattgtatta tagtactgct 
13921 ttactgtgta tctcaataaa gcacgcagtt atgttac 



- 7 



FIGURE 11 



Query=Human Dystrophin 1220-9328; Sbjt=Mouse Dystrophin 1238-9319 
Query: 1220 gaagtaaacctggaccgtta 1239 

ii i mi i inn nn 

Sbjct: 1238 gaagtaaatctggatagtta 1257 

Query: 1240 tcaaacagctttagaagaagtattatcgtggcttctttctgctgaggacaca^ 1299 

Mill II I I I I I I I I I I I I I I I I I I I I I I I II I I I II 1 I I I I III 

Sbjct: 1258 ccaaactgctttagaagaagtactttcatggcttctttctgccgaggatacattgcgagc 1317 

Query: 1300 acaaggagagatttctaatgatgtggaagtggtgaaagacc^ "59 

1 1 1 1 1 1 1 1 1 1 II 1 1 1 lllllill III llllllll 1 1 1 1 1 1 I ■ I llllllllll 
Sbjct: 131B acaaggagagatttcaaatgatgttgaagaagtgaaagaacagtttcatgctcatgaggg 1377 

Query: 1360 ^acatgatggatttgacagcccatcagggccgggtt^ 

Sbjct: 1378 aitcatgitggltcigacatctcatcaaggacttgttggtaatgttctacagttaggaag 1437 

Query: 1420 -agctgattggaacaggaaaattatcagaagatga^ 

Sbjct: 1438 tcaactagttggaaaagggaaattatcagaagatgaagaagctgaagtgcaagaacaaat 

Query; 1480 gaatctcctaaattcaagatgggaatgcctcagggtagcta^ 1539 
' || lllllll llUillll Mill III! I I I M I 11 M I M M L I 11 I I M II I M M 1 1 

Sbjct: 1498 gaatctcctaaattcaagatgggaatgtctcagggtagctagcatggaaaaacaaagcaa 1557 

ffl Query: 1540 tttacat agagttttaatggatctccagaatcagaaactgaaagagttgaatgac^ 1599 

n 1 1 1 1 1 i ii 1 1 iMiiiiiiiimminiii i inn i 1 | lfil7 

g Sbjct: 1558 attacac aaagttctaatggatctccagaatcagaaattaaaagaactagatgactggtt 1617 

W Query: 1600 aacaaaaacagaagaaagaacaaggaaaatggaggaagagc^ 1659 

ffi lllllllll Mill HIM I I 1 1 I I 11 I M I I I I I 1 1 I I I I I II Ml M MM I _ 

gj Sbjct: 1618 aacaaaaactgaagagagaactaagaaaatggaggaagagccctttggacctgatcttga 

■F Query: 1660 agacctaaaacgccaagtacaacaacataaggtgcttcaagaagatctagaacaagaaca 1719 

W y • iii 1 111 1 1 mi illinium ii ill 1 1 1 1 1 m 1 1 1 1 1 1 n 1 nn 

Sbjct: 1678 agatctaaaatgccaagtacaacaacataaggtgcttcaagaagatctagaacaggagca 17^ 
£ Query: 1720 agtcagggtcaattctctcactcacatggtggtggtagttgatgaatctagtgg^ 1779 

□ iiiiiniiii ii iimiiiiiini n ii milium n n 

□ Sbjct: 1738 ggtcagggtcaactcgctcactcacatggtagtagtggttgatgaatccagcggtgatca 1/9/ 

H Query: 1780 cgcaactgctgctttggaagaacaacttaaggtattgggagatcg^ 1639 

2 Mill . I 1 I I MM I M M 1 1 1 1 I M M I I I I I I II I M 11 I M I I I M I M I llj' «- 

B sbjct: 17 98 tgcaacagctgctttggaagaacaacttaaggtactgggagatcgatgggcaaatatctg 

Query: 1840 tagatggacagaagaccgctgggttcttttacaagacatc fttctcaaatggcaacgtct 1899 
• llllllll M I 1 1 1 1 1 1 M I II M M M M M M Mill Mllllll III 

Sbjct: 1858 cagatggactgaagaccgctggattgttttacaagatatt cttctaaaatggcagcattt 131/ 

Query: 1900 tactgaagaacagtgcctttttagtgcatggctttcagaaaaagaagatgcagtgaa^ 1959 

ii 1 1 ii iiiiiii niiii imin inn ii"'''! '''' '! 1(n7 

Sbjct: 1918 tactgaagaacagtgcctttttagtacatggctttcagaaaaagaagatgcaatgaagaa iy// 
Query: 1960 gattcacacaactggctttaaagatcaaaatgaaatgttatcaagtcttcaaaaactggc 2019 

inn nn mmiiiiniiinimnii i minimi ni i i 

Sbjct: 1978 cattcagacaagtggctttaaagatcaaaatgaaatgatgtcaagtcttcacaaaatatc 
Query: 2020 cgttttaaaagcggatctagaaaagaaaaagcaatccatgggcaaactgtattcactcaa 2079 

- imin iiiiiinmiiiiiiii i mm inn in mi in 

Sbjct: 2038 tactttaaaaatagatctagaaaagaaaaagccaaccatggaaaaactaagttcactcaa ^uy/ 
Query: 2080 • acaagatcttctttcaacactgaagaataagtcagtgacccagaagacggaagcatggct 2139 

MMMii urn mini imimimimm m mi nn m i 

Sbjct: 2098 tcaagatctactttcggcactgaaaaataagtcagtgactcaaaagatggaaatctggat ^id/ 

Query: 2140 ggataactttgcccggtgttgggataatttagtccaaaaacttgaaaagagtaca^ 2199 

Ml Ml I II It I M Mill MMM lllllll MM MMMII Ml 111 I 
Sbjct: 2158 ggaaaactttgcacaacgttgggacaatttaacccaaaaacttgaaaagagttcagcaca 
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Query: 
Sbjct: 
Query: 
Sbjct: 
Query: 
Sbjct: 
Query: 
Sbjct: 
Query: 
Sbjct: 



2200 
2218 
2260 
2278 
2320 
2338 
2380 
2398 
2440 
2458 



Query: 2500 
Sbjct: 2518 
Query': 2560 
Sbjct: 2578 
Query: 2620 
Sbjct: 2638 
Query: 2680 
Sbjct: 2698 
Query: 2740 
Sbjct: 2758 
Query: 2800 
Sbjct: 2818 
Query: 2860 
Sbjct: 2878 
Query: 2920 
Sbjct: 2938 
Query: 2980 
Sbjct: 2998 
Query: 3040 
Sbjct: 3058 
Query: 3100 
Sbjct: 3118 
Query: 3160 
Sbjct: 3178 



gatttcacaggct gtcaccaccactcagccatcactaacacagacaactgtaatgqaaac -2259 

minim Milium him i m 1 1 1 1 1 1 1 1 1 1 1 1 ri 1 1 1 1 1 1 1 1 

aatttcacaggct gtcaccaccactcaaccatccctaacacagacaactgtaatggaaac 2277 

agtaactacggtgaccacaagggaacagatcctggtaaagcatgctcaagaggaacttcc 2319 
J' Ml l f MMIMIIIIIIIIIII III Hillll Mill IIIIIIIIUIIII 
ggtaactatggtgaccacaagggaacaaatcatggtaaaacatgcccaagaggaacttcc 2337 

accaccacctccccaaaagaagaggcagattactgtggat tctgaaattaggaaaaggtt 2379 

liiiiiiniii miiiiiiimmi iiimin n iiiiiiiiii 

accaccacctcctcaaaagaagaggcagataactgtggat tctgaactcaggaaaaggtt 2397 

ggatgttgatataactgaacttcacagctggattactcgctcagaagctgtgttgcagag 2439 
Di ll Milllllllllllllllll llllllimi INIIIIHII || lim 
ggatgtcgatataactgaacttcacagttggattactcgttcagaagctgtattacagag 2457 

tcctgaatttgcaatctttcggaaggaaggcaacttctcagacttaaaagaaaaagtcaa 2499 
> IIMIMIlll Ml Ml II milllil IMIIMMI IIIMMIIMM 
ttctgaatttgcagtctatcgaaaagaaggcaacatctcagacttgcaagaaaaagtcaa 2517 

tgccatagagcgagaaaaagctgagaagttcagaaaactgcaagatgccagcagatcaqc 2559 
MINIM IIIIMlim IMIMMIMMMiMMIIMIIMIIMMMII 
rgccatagcacgagaaaaagcagagaagttcagaaaactgcaagatgccagcagatcagc 2577 

^???????! : ??^? ga3ca9atggtgaat 9 a ^ t g ttaat g ca gatagcatcaaacaagc 2619 

i'liiiiiiiimmMiMi MmiiMiimiM 1 1 11 mi mill 

tcaggccctggtggaacagatggcaaatgagggtgttaatgctgaaagtatcagacaagc 2637 

C ! : ??? aa ? aa ?! : ? aacagccggtggatcgaattct 5ccagttgctaagtgagagacttaa 2679 
INIIIMMIIIMIIIIMIMI Ml HIM lil|||||| | jt | 

ttcagaacaactgaacagccggtggacagaattctgccaattgctgagtgagagagttaa 2697 

ctggctggagtat cagaacaacatcatcgctttctataatcagctacaacaattggagca 2739 

ctaal LLlllll 11 ' MI "' !M 1 'I MMMMIMIIIIMMIIM II 
ctggctagagtat caaaccaacatcattaccttttataatcagctacaacaattggaaca 2757 

gatgacaactactgctgaaaactggttgaaaatccaacccaccaccccatcaga'gccaac 2799 
INIMMMHMI MINI! M I j 1 M ( III I |f||||| MMIIIIMM 

gatgacaactactgccgaaaacttgttgaaaacccagtctaccaccctatcagagccaac 2817 

????? ^ a fff gt ? ag ^ aaa f attt 5taaggatgaagtcaaccggctatcaggtcttca 2859 

iiUIU 1 ' 1 " "III" 1 MIMIIMIMMIII I I Mil HUM 

agcaattaaaagccagttaaaaatttgtaaggatgaagtcaacagattgtcagctcttca 2877 

a ??f? aaa M? aa ? 9 ?^ aaaaat ^ 2919 
M M i M M N I IIMIimill II II I I 1 1 1 1 1 J f | Mill II II 

gcctcaaattgagcaattaaaaattcagagtctacaactgaaagaaaagggacaggggcc 2937 

°? ??^ c ?! : ??^?? aga f" tgtggcctttacaa 

^IJ-Jl^^^ l I r11 ' 1 Illlllllllllll IIMIMIlll II Mill II 

aatgtttctggatgcagactttgtggcctttactaatcattttaaccacatctttgatgg 2997 

tgtgcaggccagagagaaagagctacagacaatttttgao actttgccaccaatgcgcta 3039 

111,1 IIMIMMMMIIIMIMIMIIM I I I 1 1 1 1 1 1| I M 1 1 1 1 I I 
tgtgagggccaaagagaaagagctacagacaatttttgac actttaccaccaatgcgcta 3057 

I'lllMII- MUM M I I 1 1 I I I Ml MMMIIIIMMI limilll I 
tcaggagacaatgagtagcatcaggacgtggatccagcagtcagaaagcaaactctctgt 3117 

acctcaacttagtgtcaccgactatgaa^ 3159 

accititcUiiU^ ' L ' ! 111,11 IIIIMMMI MM Mill 

accttatcttagtgttactgaatatgaaataatggaggagagactcgggaaattacaggc 3177 

tttacaaagttctctgcaagagcaacaaagtggcctatactatctcagcaccactgtgaa 3219 

tciaiUiiUUJ 1 1,11,1111111 i 111 " MIIMI II lllllllll 
tccgcaaagttctttgaaagagcaacaaaatggcttcaactatctgagtgacactgtgaa 3237 
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FIGURE 11 (cont. 
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Query: 3220 
Sbjct: 3238 
Query: 3280 
Sbjct: 3298 
Query: 3340 
Sbjct: 3358 
Query: 3400 
Sbjct: 3418 
Query: 34 60 
Sbjct: 3478 
Query: 3520 
Sbjct: 3538 
Query: 3580 
Sbjct: 3598 
Query: 3640 
Sbjct: 3658 
Query: . 3700 
Sbjct: 3718 
Query: 37 60 
Sbjct: 3778 
Query: 3820 
Sbjct; 3838 
Query: 3880 
• Sbjct: 3898 
Query: 3940 
Sbjct: 3958 
Query: 4000 
Sbjct: '4018 
Query: 4060 
Sbjct: 4078 
Query: 4120 
Sbjct: 4138 

Query: 4180 
Sbjct: 4198 



aqagatgtcgaagaaagcgccctctgaaattagccggaaatatcaatcagaatttgaaga 327 9 

iiiiii i 1 1 1 1 ii 1 1 ii ii Mill iii mum Minimum 

ggagatggccaagaaagcaccttcagaaatatgccagaaatatctgtcagaatttgaaga J^y / 

aattgagqgacgctggaagaagctctcctcccagctggttgagcattgtcaaaagctaga 3339 

MIMIM I I II 111 I I I II 1 1 I 1 1 1 I I 1 llll II. M I" 1111 '''! 
gattgaggggcactggaagaaactttcctcccagttggtggaaagctgccaaaagctaga JJ:>/ 

gqagcaa atgaataaactccgaaaaattcagaatcacatacaaaccctgaagaaatggat 3399 

iiii iiimiini iiiiii iiiimii.mil Mini mmmmi 

agaacat atgaataaacttcgaaaatttcagaatcacataaaaaccttacagaaatggat 34 l / 
qqctgaagttgatgtttttctgaaggaggaatggcctgcccttggggattcagaaattct 3459 

iiiiiniiiiiiiiiii inn iiiiiiiiiiiiiini nun i i hi i i' , 4T7 

ggctgaagttgatgttttcctgaaagaggaatggcctgccctgggggatgctgaaatcct 31 / f 

aaaaaagcagctgaaacagtgcagacttttagtcagtgatattcagacaattcagcccag 3519 

Mill Mill Mill MM MM I I I I I M M I I I I I I I I I I I I I I I 

gaaaaaacagctcaaacaatgcagacttttagttggtgatattcaaacaattcagcccag 353/ 

tctaaacaqtgtcaatgaaggtgggcagaagataaagaatgaagcagagccagagtttgc 3579 

I MM Mill IMIimitllMMMMMMI IIIIII II I HH"M 
tttaaatagtgttaatgaaggtgggcagaagataaagagtgaagctgaacttgagtttgc jos / 

ttcqaqacttqagacagaactcaaagaacttaacactcagtgggatcacatgtgccaaca 3639 

ll lllll MIMMIMM Ml MIMMMMMMMMMM MM M • 
atccagactggagacagaacttagagagcttaacactcagtgggatcacatatgccgcca Jbb/ 

• qgtctatgccagaaaggaggccttgaagggaggt ttggagaaaactgtaagcctccagaa 3699 

IIIIII IMIMIIII Mill MM MM Mill Mill IMIMMIM M 
ggtctacaccagaaaggaagccttaaaggcaggt ttggataaaaccgtaagcctccaaaa sni 

agatctatcagagatgcacgaatggatgacacaagctgaagaagagtatcttgagagaga 3759 

1 1 MM IMIIM II. I MIMIMM MUM IIIIII I Ml III I 

agatctatcagagatgcatgagtggatgacacaagctgaagaagaatatctagagagaga 3 / // 

ttttgaatataaaactccagatgaattacagaaagcagttgaagagatgaagagagctaa 3819 

I I iiiiii mm miMiM mini 1 1 11 iiiimi mmimmimm 

ttttgaatataaaactccagatgaattacagactgctgttgaagaaatgaagagagctaa 383/ 

aqaagaggcccaacaaaaagaagcgaaagtgaaactccttactgagtctgtaaatagtgt 3879 

MMMIM I MIMIMM I 1 1 1 1 1 1 II M i M 1 1 II II M 1 1 1 1 1 1 1 1 1 1 1 1 1 
agaagaggcactacaaaaagaaactaaagtgaaactccttactgagactgtaaatagtgt. 39 9/ 



catagctcaagctccacctgtagcacaagaggccttaaaaaaggaacttgaaactctaac 

limill MIMIM M 1 I I I I I I I I I I M I I ! I j I M M 1 I i M I I I I M II 
aatagctcacgctccaccctcagcacaagaggccttaaaaaaggaacttgaaactctgac 

caccaactaccagtggctctgcactaggctgaatgggaaatgcaagactttggaagaagt 

IMIIIMMM Mill Mill IMMMIMI MIMIM 11111111111111 
caccaactaccaatggctgtgcaccaggctgaatggaaaatgcaaaactttggaagaagt 

t tgggcatgttggcatgagttattgtcatacttggagaaagcaaacaagtggc.taaatga 

I I M I 1 I I I U I I I I I! I I I I M I I I I I M II ll li I I I I llll I MIM I, Mill 
t tgggcatgttggcatgagttattgtcatatttagagaaagcaaacaagtggctcaatga 



3939 
3957 
3999 
4017 
4059 
4077 
4119 



agtagaatttaaacttaaaaccactgaaaacattcctggcggagctgaggaaatctctga 

III Mill I II IIIIII III II Mill MIMI III I Mill HI M MM 
agtagaattgaaacttaaaaccatggaaaatgttcctgcaggacctgaggaaatcactga 4137 



qgtgctagattcacttgaaaatttgatgcgacattcagaggataacccaaatcagattcg 

II Ml II I II I llll III I MIMI MIMI Ml It Mill III IMIIM II 
agtgctagaatctcttgaaaatctgatgcatcattcagaggagaacccaaatcagattcg 



4179 



4197 



4239 



catattggcacagaccctaacagatggcggagtcatggatgagctaatcaatgaggaact 

MM III MIMI II MM I I II MM I Ml MM II II Ml I I'll H I i II 
tctattggcacagactcttacagatggaggagtcatggatgaactgatcaatgaggagct 4^57 



FIGURE 11 (cont. 



0 

□ 

Ul 

® 
m 

rjeu 
■ 

o 
o 
tn 



Query: 4240 
Sbjct: 4258 
Query; 4300 
Sbjct: 4318 
Query: 4360 
Sbjct: 4378 
Query: 4420 
Sbjct: 4 4 38 
Query: 4480 
Sbjct: 4498 
Query: 4540 
Sbjct: 4558 
Query: 4600 
Sbjct: 4618 
Query: 4660 
Sbjct: 4678 
Query: 4720 
Sbjct: 4738 
Query: 4780 
Sbjct: 4798 
Query: 4840 
Sbjct; 4858 
Query: 4900 
Sbjct: 4918 
Query: 4960 
Sbjct; 4978 
Query: 5020 
Sbjct: 5038 
Query: 5080 
Sbjct: 5098 
Query: 5140 
Sbjct: 5158. 



tgagacatttaattctcgttggagggaactacatgaagaggctgtaaggaggcaaaagtt 4299 

iiiiii '111111111111111111111 miiiiiiiii mi mmii 

tgagacgtttaattctcgttggagggaactacatgaagaggctgtgaggaaacaaaagtt 4J1 / 



qcttqaacaqaqc atccagtctgcccaggagactgaaaaatccttacacttaatccagga 

iminiiiii 1 1 1 1 1 ; 1 1 1 1 mi i n i iiiiii inn mum inn 

gcttgaacagagt 'atccagtctgcccaggaaattgaaaagtccttgcacttaattcagga 

gtccctcacattcattgacaagcagttggcagcttatattgcagacaaggtggacgcagc 

III II II I tl IMNI lllllllll! I i I MMII I lllllllllll inn 
gtcgcttgaattcattgacaagcagttggcagcttatatcactgacaaggtggatgcagc 

tcaaatgcctcaggaagcccagaaaatccaatctgatttgacaagtcatgagatcagttt 

I t I t 111 I I I I I I I III I III I I III I I I I I I I IM I I I I Ml I III I I I I I I I I I I I 
tcaaatgcctcaggaagcccagaaaatccaatcagatttgacaagtcatgagataagttt 

agaagaaatgaagaaacataatcaggggaaggaggctgcccaaagagtcctgtctcagat 

I I I Ml I I I 111 I I I I I III I I ill Ml I I I I II I I I III M II II M II 
agaagaaatgaagaaacataaccaggggaaggatgccaaccaaagggttctttcacaaat 

tgatgttgcacagaaaaa'attacaagatgtctccatgaagtttcgatta ttccagaaacc 

mi I ii i mi 1 1 1 1 iii iiimiimi mi i ii ii iiiiiiiii mil inn 

tgatgttgcacagaaaaaattacaagatgtctccatgaaatttcgatta ttccaaaaacc 



4359 
4377 



4419 



4437 
4479 
4497 
4539 
4557 
4599 



4617 



agccaattttgagctgcgtctacaagaaagtaagatgattttagatgaagtgaagatgca 4659 

llllllllllll I IIIIM I I I MM II I IMMI ! I I I Mill!! I I I I t I I I _ 
agccaattttgaacaacgtctagaggaaagtaagatgattttagatgaagtcaagatgca 4677 



cttgcctgcattggaaacaaagagtgtggaacaggaagtagtacagtcacagctaaatca 

I I II II I I III 111 I I I I I I I II IE llllllllllll I II M I I II I I I I III 
tttgcctgcattggaaaccaagagtgttgaacaggaagtaattcagtcacaactaagtca 

ttgtgtgaacttgtataaaagtctgagtgaagtgaagtctgaagtggaaatggtgataaa 

II I I I I I I I I I I I M I III I I I Ml I I I I I II M III I I I I I I Ml I I I Ml I I I II 
ttgtgtgaacttgtataaaagcctgagtgaagtcaagtctgaagtggaaatggtgattaa 



4719 
4737 



4779 



4797 



4839 



Query: 



5200 



Sbjct: 5218 



gactggacgtcagattgtacagaaaaagcagacggaaaatcccaaagaacttgatgaaag 

ii iii inn iiiiiii i iM iiiiii iii i mi mi urn miiim i 

aaccggacgtcaaattgtacagaaaaagcagacagaaaatcccaaagagcttgatgaacg 4857 

agtaacagctttgaaattgcattataatgagctgggagcaaaggtaacagaaagaaagca 4899 

III MMMMMIMMMI IM IIIIII MM II IMIMIMM llllllll 
agtaacagctttgaaattgcattacaatgagttgggtgcgaaggtaacagagagaaagca 4 917 

acagttggagaaatgo ttgaaattgtcccgtaagatgcgaaaggaaatgaatgtcttgac 4 959 

I M II I I II M I M I I lim lllllllllllllll IMIIIIimilMIIII II 
acagttggagaaatgc ttgaagttgtcccgtaagatgagaaaggaaatgaatgtcttaac 4977 

agaatggctggcagctacagatatggaattgacaaagagatcagcagttgaaggaatgcc 5019 

MMMMIIIMM IIIIIII MMIMI Mill IIIIII IIMI Ml I III Ml 
agaatggctggcagcaacagatacagaattgacgaagagatcagcagttgaaggaatgcc 5037 

tagtaattfcggattctgaagttgcctggggaaaggctactcaaaaagagattgagaaaca 5079 

I II II I I I II I I I I I I I M II I III I I II I I I I I I I I II Ml I I I I I III M I III Mi 
aagtaatttggattctgaagttgcctggggaaaggctactcaaaaagagattgagaaaca 5097 

gaaggtgcacctgaagagtatcacagaggtaggagaggccttgaaaacagttttgggcaa 5139 

Mill I I I I Ml I III I I I I I I llllllll I IIIIIII II MMIMI 
gaaggctcacttgaagagtgttacagaattaggagagtctttgaaaatggtgttgggcaa 5157 

gaaggagacgttggtggaagataaactcagtcttctgaatagtaactggatagctgtcac 5199 

III M II IIMI IIIIMimi milllllll Mill Mill M Mill III 
gaaagaaaccttggtagaagataaactgagtcttctgaacagtaactggatagctgtcac 5217 

ctcccgagcagaagagtggttaaatcttttgttggaatac cagaaacacatggaaacttt 5259 

MM IM IIIIII III M 1 M II 1 1 1 1 11 M M I M 11111111111111111 II 
ctccagagtagaagaatggctaaatcttttgttggaatac cagaaacacatggaaacctt 5277 



p 



Query: 

Sbjct : 

Query: 

Sbjct: 

Query: 

Sbjct: 

Query: 

Sbjct: 

Query: 

Sbjct: 

Query: 

Sbjct: 

Query: 

Sbjct: 



nil Query: 
Sbjct: 
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D 



Query: 
Sbjct: 
Query: 
Sbjct: 
Query: 
Sbjct: 
Query: 
Sbjct: 
Query: 
Sbjct: 
Query: 
Sbjct: 
Query: 
Sbjct: 
Query: 
Sbjct: 
Query: 
Sb j ct : 



5260 
5278 
5320 
5338 
5380 
5398 
5440 
5458 
5500 
5518 
5560 
5578 
5620 
5638 
5680 
5698 
5740 
5758 
5800 
5818 
5860 
5878 
5920 
5938 
5980 
5998 
6040 
6058 
6100 
6118 
6160 
6178 
6220 
6238 



FIGURE 11 (cont, 

tgaccagaatgtggaccacatcacaaagtggatcattc^ 5319 

mi iiiii til II 1 1 it 1 1 1 ii ii hi 1 1 1 N ii mill m M " 6 ™ 

[gitcajaaca.^ 5337 
atcagagaaaaagaaaccccagcaaaaagaagacgtgctt^ 5379 

ii ii ii 11111111 ii iiiii mm i inmniiiiimi in i ■ 

gtitiaaaiglaUiacwiaaMaaaggaagacattcttaagcgtttaaaggctgaaat 5397 
gaatgacatacgcccaaaggtggactctacacgtgaccaagcagcaaacttgatg^ 5439 

ii i mi i ii 1 1 1 1 1 1 1 1 1 T i ii 1 1 1 1 ii ii hum i mi urn mil min 

JaatJacaUJc^ 5457 
ccgcggtgaccactgcaggaaattagtagagccccaaatctcagagctcaaccatcga^ 5499 

MiiiiiiiiiiiiiMiiiii mm mmi mimm mm 5517 

ccgcggUacciclgcaggaaagtagt^ 5517 
tgcagccatttcacacagaattaagactggaaaggcctccattcctttg^ 9 aa "?9 B 5559 

iinii iiiii inMiiiimmmiimimmmmm "'i'l'i ,k ni 

igiagkittUt««iaUtaagactggaaaggcctccattcctttgaag gaattgga 5577 

gcagtttaactcagatatacaaaaattgcttgaacc^ ^19 
I I [ | | | | | | | | I | | | | I | I I I II I I I I I M I M I I I I I I t I I I I I I I I I I'll " 1 I I 'i 
gilUttiactiaiatataM 5637 

ggtgaatctgaaagaggaagacttcaataaagatatgaatgaagacaatgaggg^ 5679 

IIIMIllllllllllllllllilMIIIIIIIIIIII I S697 

ggtgaatctgaaagaggaagacttcaataaagatatgagtgaagacaatgagggtactgt 5697 

aaaagaattgttgcaaagaggagacaacttacaacaaagaatcacagatg^ 5739 
Ml | 1 1 1 1 1 M 1 1 M 1 1 1 1 M t M I M M M M 1 M M M 1 M M 1 1 1 1 1 1 1 M M I I 

iaitgaattgttgcaaagaw 575 

agaggaaataaagataaaacagcagctgttacagacaaaacataatgctctw 5799 

I I I I I I 1 I I I I I I I I I I I I I I I I I I I I 1 I I I M I I I I I I I I I I I 1 I I 1 " I 1 1 1 1 1 1 41 coi n 

agagUaaUalgataaaaca^ 5817 
gaggtctcaaagaagaaaaaaggctctagaaatt tctcatcagtggtatc^ 5859 

iiiiiniiimiiiimim himiiii mn iiiimiiiiiiiiiini 

iaggtitcaaagaagaaaaaaggccctagaaatt tctcaccagtggtatcagtacaagag 5877 

gcaggctgatgatctcctgaaatgcttggatgacattgnnnnnnnnttagccagcctac^ 5919 

lllllllllllllllllllilllllllllllll llll llllllllllllll 
gcaggctgatgaUtcctgaaatgcttgga 5937 

tgagcccagagatgaaaggaaaataaaggaaattgatcgggaattgcagaagaagaaaga 5979 

iii nniiiiiiini m miiiiimiim i m 1 1 1 1 1 1 m m 1 1 i i 

tgaacccagagatgaaagaaaattaaaggaaattgatcgtgaattgcagaagaa 5997 
ggagctgaatgcagtgcgtaggcaagctgagggcttgtctgaggat^ 6039 

iiiiimiiiiimii iiiiiiiiiiiiimmiiin in m'"'" , 0 c 7 

ggagctgaatgcagtgcgcaggcaagctgagggcttgtctgagaatggggccgcaatggc 6057 
agtggagccaactcagatccagctcagcaagcgctggcgggaaattgagagcaaatttgc 6099 

iiiiiii iiiiiMininiiiiiiiMim miimiim inn 

agtggagccaactcagatccagctcagcaagcgctggcggcaaattgagagcaattttgc bin 
tcagtttcgaagactcaactttgcacaa attcacactgtccgtgaagaaacgatgatggt 6159 

iiiiiiiiiiiimiiiiiiiiimi iimnii in mimii m i n 

tcagtttcgaagactcaactttgcacaa attcacactctccatgaagaaactatggtagt 6177 
gatgactgaagacatgcctttggaaatttcttatgtgccttctacttatttgactgaaat 6219 

ii iiiiinii minimi iiiiiiiniimin in n n 

gacgactgaagatatgcctttggatgtttcttatgtgccttctacttatttgaccgagat 6237 
cactcatgtctcacaagccctattagaagtggaacaacttctcaatgctcctgacctctg 6279 

n mi m min n i mm n n nm in mini nm 

cagtcatatcttacaagctctttcagaagttgatcatcttctaaatactcctgaactctg 6297 
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Query: 6280 
Sbjct: 6298 
Query: 6340 
Sbjct: 6358 
Query: 6400 
Sbjct: 6418 
Query: 6460 
Sbjct: 6478 
Query: 6520 
Sbjct: 6538 
Query: 6580 
Sbjct: 6598 
Query: 6640 
Sbjct: 6658 
Query: 6700 
Sbjct: 6718 
Query: 6760 
Sbjct: 6778 
Query: 6820 
Sbjct: 6838 
Query: 6880 
Sbjct: 6898 
Query: 6940 
Sbjct: 6952 
Query: 7000 
Sbjct: 7012 
Query: 7060 
Sbjct: 7072 
Query: 7120 
Sbjct: 7132 
Query: 7180 
Sbjct: 7192 
Query: 7240 
Sbjct: 7231 



FIGURE 11 (cont. 

tgctaaggactttgaagatctctttaagcaagaggagtctctgaag^ "3? 

Illlll II llllillllll tIMIIIIIII illl MilllllllllM I 

tgctaaagattttgaagatctttttaagcaagaggagtctcttaagaatataaaagacaa 6357. 

tctacaacaaagctcaggtcggattgacattattcatagcaa^ 6399 

i i mini 1 1 iiiiiiiiiiiii ii mm i > i u ' 1 1 > ' nmi mi 

tttgcaacaaatctcaggtcggattgatattattcacaagaagaagacagcagccttgca 6417 

aagtgcaacgcctgtggaaagggtgaagctacaggaagctctctcccagcttgatttcca 6459 

Illlll M I Illlll HUM IMIIMMI I IIMI minim 
aagtgccacctccatggaaaaggtgaaagtacaggaagccgtggcacagatggatttcca 64 / ? 



atgggaaaaagttaacaaaatgtacaaggaccgacaagggcgatttgacagatct gttga 6519 

.1111 Illlll I I I Ml 1 1 II M II MIMIMMMM 11111111 ' Mill 
gggggaaaaacttcatagaatgtacaaggaacgacaagggcgattcgacagatca gttga bi J / 

qaaatggcggcgttttcattatgatataaagatatttaatcagtggctaacagaagctga 6579 

iiiiiiiii tmiiiMMiM ill iiiiiiiiri urn i Jim ii . 

aaaatggcgacactttcattatgatatgaaggtatttaatcaatggctgaatgaagttga bi>y/ 
acactttctcagaaagacacaaattcctgagaattgggaacatgctaaatacaaatggta 6639 

mini in niiiiiiiii mm m i i i m i i i i i i i i i i m m n i i i 

acagtttttcaaaaagacacaaaatcctgaaaactgggaacatgctaaatacaaatggta bbb/ 

tcttaaaaaactccaqqatqqcattgggcagcggcaaactgttgtcagaacattgaatgc 6699 

I 1 M II I M 1 1 M 1 1 II Mi 1M 1 1 II III I M III limimiMM IMllll 
tcttaaggaactccaggatggcattgggcagcgtcaagctgttgtcagaacactgaatgc b/i/ 

aactqqqqaaqaaataattcagcaatcctcaaaaacagatgccagtattctacaggaaaa 6759 

ii ii I iiiiiMMi in mi ii ii iiiiiiiiiiiii ii i mm ii inn 

aactggggaagaaataattcaacagtcttcaaaaacagatgtcaatattctacaagaaaa b/ f / 
attqqqaaqcctgaatctgcggtggcaggaggtctgcaaacagctgtcagacagaaaaaa 6819 

iii iiiiii mi ii in ii iiiinii i i hi CQ „ 

attaggaagcttgagtctgcggtggcacgacatctgcaaagagctggcagaaaggagaaa bHJ / 
qaqqctaqaagaacaa aagaatatcttgtcagaatttcaaagagatttaaatgaatttgt 687 9 

nil i iiiiiiiii mm nm m 1 1 1 1 m i M 1 1 1 1 1 mi 1 1 11 1 cqoi 

gaggattgaagaacaa aagaatgtcttgtcagaatttcaaagagatttaaatgaatttgt btf 9 / 

tttatggttggaggaagcagataacattgctagtatcccacttgaacctggaaaagagca 6939 

III III III! 1111111111)11111111 II lllll Illl I nm 

tttgtggctggaagaagcagataacattgctattactccact tggagatgagca b9bi 

gcaactaaaagaaaagcttgagcaagtcaagttactggtggaagagttgcccctgcgcca 6999 

iii miimi iiiiiiiii limn 1 1 1 1 1 1 n 1 1 1 1 1 m 1 1 1 1 

gcagctaaaagaacaacttgaacaagtcaagttactggcagaagagttgcccctgcgcca /Oil 

gggaattctcaaacaattaaatgaaactggaggacccgtgcttgtaagtgctcccataag 7059 

lllllllll I 1 I I I M I 1 I j I I I M t Illlll I II I II II Illl llllillllll 
gggaattctaaaacaattaaatgaaacaggaggagcagtacttgtaagtgctcccataag 7071 

cccagaagagcaagataaacttgaaaataagctcaagcagacaaatctccagtggataaa 7119 

i ii iim mm m i! i M 1 1 1 mi mum i m i m 1 1 u 1 1 1 1 m 1 1 1 1 

gccagaagagcaagataaacttgaaaagaagctcaaacagacaaatctccagtggataaa 7131 

ggtttccagagctttacctgagaaacaaggagaaattgaagctcaaataaaagaccttgg 7179 

III 1 1 1 1 i I 1 1 1 1 1 1 MMMMIM Illl I III MIIMI II I 

ggtctccagagctttacctgagaaacaaggagagcttgaggttcacttaaaagattttag 7191 

gcagctt gnnnnnnngcttgaagaccttgaagagcagttaaatcatctgctgctgtggtt 7239 

Illl IIMIIIIMM I Illl lllll Illlll I 

gcag - cttgaagagcagctggatcacctgcttctgtggct 7230 

atctcctattaggaatcagttggaaatttataaccaaccaaaccaagaaggaccatttga 7299 

nnmim n miiiimmminniMi n i mm nm 

ctctcctattagaaaccagttggaaatttataaccaaccaagtcaggcaggaccgtttga 7290 



FIGURE 11 (cont. 



Query: 


7300 


Sbjct: 


7291 


Query: 


7360 


Sbjct : 


7351 


Query: 


7420 


Sbjct: 


7411 


Query: 


7480 


Sbjct: 


7471 


Query: 


7540 


Sbjct: 


7531 


Query: 


7600 


Sbjct: 


7591 


Query: 


7660 


Sbjct: 


7651 


Query: 


7720 


Sbjct: 


7711 


Query: ' 


7780 


Sbjct: 


7771 


Query: 


7840 


Sbjct: 


7831 


Query: 


7900 


Sbjct: 


7891 


Query: 


7960 


Sbjct: 


7951 


Query: 


8020 


Sbjct: 


8011 


Query: 


8080 


Sbjct: 


8071 


Query: 


8140 
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Query: 


8200 


Sbjct: 


8191 


Query: 


8260 


Sbjct: 


8251 



cgttcaggaaactgaaatagcagttcaagctaaacaaccggatgtggaagagattttgtc 7359 

II III! I 111! II III II I I III I II I II I II I I I I I I I I.I I ca 

cataaaggagattgaagta'acagttcacggtaaacaagcggatgtggaaaggcttttgtc 7350 

taaagggcagcatttgtacaaggaaaaaccagccactcagccagtgaagaggaagttaga 7419 

mmmimmiimm iiimiinii • 1 1 iii i ii 1 1 ii Minimi mil 

gaaagggcagcatttgtataaggaaaaaccaagcactcagccagtgaagaggaagttaga 7410 



agatctgagctctgagtggaaggcggtaaaccgtttacttcaagagctgagggcaaagca- 

llltlllll 1IIMIIII I I I I I I I M I I muni MINIMI lllllll 
agatctgaggtctgagtgggaggctgtaaaccatttacttcgggagctgaggacaaagca 



7479 
7470 



gcctgaccta gctcctggactgaccactattggagcctctcctactcagactgttactct 7539 

llllllll II IMMIIIII Mill MIIIIMM 'I I I I I I I I I I ! M M I I 
gcctgaccgt gcccctggactgagcactactggagcctctgccagtcagactgttactct 7530 

ggtgacacaacctgtggttactaaggaaactgccatctccaaactagaaatgccatcttc 7599 

II MM II I II I I I I I II I I I I I I I II II t I M I I I M I I III I I I II M I I I I ill - 
agtgacacaatctgtggttactaaggaaactgtcatctccaaactagaaatgccatcttc 7590 

cttgatgttgaag gtacctgctctggcagatttcaaccgggcttggacagaacttaccga 7659 

III llllllll I III LI! I llllllll Mill I II Ml I Mill II! MMI II 
tttgctgttggag gtacctgcactggcagacttcaaccgagcttggacagaacttacaga 7650 

* 

ctggctttctctgcttgatcaagttataaaatcacagagggtgatggtgggtgaccttga 7719 

MMM MMIIMIMM i M I I M I I I I M M 1 I I IMIIIMIIMM II II 
ctggctgtctctgcttgatcgagttataaaatcacagagagtgatggtgggtgatctgga 7710 

ggatatcaacgagatgatcatcaagcagaaggcaacaatgcaggatttggaacagaggcg 7779 

II IMM II I I I I II I II M MMIMIMM MM I 1 1 I M 1 II i I I M M . 
agacatcaatgaaatgatcatcaaacagaaggcaacactgcaagatttggaacagagacg 7770 

tccccagttggaagaactcattaccgctgcccaaaatttgaaaaacaagaccagcaatca 7839 

Mill IM MM MIIIIMM IMM III II I II II II M M I MIMMIMI 
cccccaattggaagaactcattactgctgcccagaatttgaaaaacaaaaccagcaatca 7830 

agaggctagaacaatcattacggatcgaattgaaagaattcagaatcagtgggatgaagt 7899 

III Mill IMMM II Ml II M I I I I II I M M i II M I I 11111111111111 
agaagctagaacaatcattactgatcgaattgaaagaattcagattcagtgggatgaggt 7890 

acaagaacaccttcagaaccggaggcaacagttgaatgaaatg ttaaaggattcaacaca 7959 

llllllll II MMM MM I ! M I M I I M I I i M I I I 1 1 1 1 ! 1 I I I I 1 I I J J I 
tcaagaacagctgcagaacaggagacaacagttgaatgaaatg ttaaaggattcaacaca 7950 

atggctggaagctaaggaagaagctgagcaggtcttaggacaggccagagcca-agcttga 8019 

I I II I II III i II II M I II I II I II MMM I I I I I I I I I Mill I I I I I I I I I 
atggctggaagctaaggaagaagccgaacaggtcataggacaggtcagaggcaagcttga 8010 

gtcatggaaggagggtccctatacagtagatgcaatccaaaagaaaatcacagaaaccaa 8079 

llllllll II Mill I MMMMII M M MMM II I IMIIIMIIMM 
ctcatggaaagaaggtcctcacacagtagatgcaatccaaaagaagatcacagaaaccaa 8070 

gcagttggccaaagacctccgccagtggcagacaaatgtagatgtggcaaatgacttggc 8139 

M I M Ml I I ! 1 1 M I! II II II MMM II MMM IMM II MM Mill 
gcagttggccaaagacctccgtcaacggcagataagtgtagacgtggcaaatgatttggc 8130 

cctgaaacttctccgggattattctgcagatgataccagaaaagtccacatgataacaga 8199 

MM lllllll Mill llllllll 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I II Ml MM II III 
actgaaacttcttcgggactattctgctgatgataccagaaaagtacacatgataacaga 8190 

gaatatcaatgcctcttggagaagcattcataaaagggtgagtgagcgagaggctgcttt 8259 
MMMMII I MINI III MMIMIMM M IMMM MIMMMMI 
gaatatcaatacttcttggggaaacattcataaaagagtaagtgagcaagaggctgcttt 8250 

ggaagaaact catagattactgcaacagttccccctggacctggaaaagtttcttgcctg 8319 

IMMIIIII MIIMMIMMI llllllll MIMMIMI MMMMl I I I I 
ggaagaaact catagattactgcagcagttccctctggacctggagaagtttctttcctg 8310 
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gcttacagaagctgaaacaactgccaatgtcctacaggat^ 

Mill Mill llllimillllllllHIIIIIII III llllllllll.l III 
gattacggaagcagaaacaactgccaatgtcctacaggacgcttcccgtaaggagaagct 8370 

cctagaagactccaagggagtaaaagagctgatgaaacaa^ 8439 

iimmmiimm nun i iiiiiiiiMiiu imiiiii mmmm I' 

wtagaagactccaggggagtcagagagctgatgaaaccatggca 8430 

aattgaagctcacacagatgtttatcacaacctggatgaaaacagccaaaaaatcctgag 8499 

I 1 1 1 1 I I I 1 1 1 1 I I 1 1 1 1 I IMItlM M llllllll Ml IMim 0iQn 

aatUaaactcacacagaiatctatcacaatcttgatgaaaatggcc 8490. 

atccctggaaggttccgatgatgcagtcctgttacaaa.gacgtttggataacatga^ 8559 

I"" 1 I H I II I M I Mill Ml I I Ml 111 M II I 1 I II I MM I ' t >' 8550 

atccctggaaggttcggatgaagcacccctgttacaaagacgtttggataacatgaattt a^u 

caagtggagtgaacttdggaaaaagtctctcaacattaggtcccatttggaagccagt tc 8619 

IMIIIMIMMMII IMIIIIMMMMMMMMMMIMIIIMI Ml M 
caagtggagtgaacttcagaaaaagtctctcaacattaggtcccatttggaagcaagt tc 861U 

tgaccagtggaagcgtctgcacctttctctgcaggaacttctggtgtggctacagctgaa. 8679 

llllllllllllllll III I llllllll M M M M M I II 1 1 I IH 1 1 Ml Mi 
tgaccagtggaagcgtttgcatctttctcttcaggaacttcttgtttggctacagctgaa 8670 

agatqatgaattaagccggcaggcacctattggaggcgactttccagcagttcagaagca 8739 
llllll I IMM MIMIM M II II M M MIMMMIIIMM 1 
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8730 



agatgatgaactgagccgtcaggcacccatcggtggtgatttcccagcagttcagaagca 

qaacgatgtacatagggccttcaagagggaattgaaaactaaagaacctgtaatcatgag 8799 

Ml III MMMMMMMMMMMMMMMMMIMMMMMMMMI 
gaatgatatacatagggccttcaagagggaattgaaaactaaagaacctgtaatcatgag b/yu 

tactcttgagactgtacgaatatttctgacagagcagcctttggaaggactagagaaact 8859 

nun iiiiiiM 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ill liiiii I JL * i ' i ' ill ' X fi850 

tactctggagactgtgagaatatttctgacagagcagcctttggaaggactagagaaact ba^u 

ctaccaqqagcccagagagctgcctcctgaggagagagcccagaatgtcactcggcttct 8919 

llllll M I M I II II I M M HIM IMIMMMMIMM M . 

ctaccaggagcccagagaactgcctcctgaagaaagagctcagaatgtcactcggctcct tjyiu 

acgaaagcaggctgaggaggtcaatactgagtgggaaaaattgaacctgcactccgctga 8979 

IIIIIIIMIIIIII MIMIM MM Mill MMMMIMM III Mill 
acgaaagcaggctgaagaggtcaacgctgaatgggacaaattgaacctgcgctcagctga 89 /u 

ctqqcagagaaaaatagatgagacc ottgaaagactccaggaacttcaagaggccacgga 9039 

MIMI IIMM I M M I 1 1 M M II I I I I II 1 II I M M I M. 

ttggcagagaaaaatagatgaagct cttgaaagactccaggaacttcaggaagctgccga yuju 

tqaqctqgacctcaagctgcgccaagctgaggtgatcaagggatcctggcagcccgtggg 9099 

III MIIIMIIMI MMMMMMIMMIMMMMMMMMIM Mill 
tgaactggacctcaagttgcgccaagctgaggtgatcaagggatcctggcagccagtggg suyu 



9159 
9150 



cgatctcctcattgactctctccaagatcacctcgagaaagtcaaggcacttcgaggaga 

| I I I ! I II I M I M I I U 1 1 IMMIMIM M II III III II MM Ml Mill 
ggatctcctcattgactctctgcaagatcaccttgaaaaagtcaaggcacttcggggaga 

aattgcgcctctgaaagagaacgtgagccacgtcaatgaccttgctcgccagcttaccac 9219 

llllll Mill MMMM M I I MMMMIIMM I Mill Mill 
aattgcacctcttaaagagaatgtcaatcgtgtcaatgaccttgcacatcagctgaccac 



9210 



tttqqqcattcaqctctcaccgtataacctcagcactctggaagacctgaacaccagatg 

I II I 1 1 I II I M I II II I I MMMMIMIIM MMIM Mill MMMM 
actgggcattcagctctcaccttataacctcagcactttggaagatctgaataccagatg 

gaagcttctgcaggtggccgtcgaggaccgagtcaggcagctgcatgaa 

II MUM MMMM M MMMM Mill MIMMMMI 
gaggcttctacaggtggctgtggaggaccgtgtcagacagctgcatgaa 



9279 
9270 



FIGURE 12 (AR4-R23, SEQ ID NO: 39) 



QGGATTCCCTCACTTTCCCCCTACA.GGACTCAGATCTaGGAGGC7^TTACCTTCGGAGAAAAACQAATAGGA 
AAAACTGAAGTGTTACTTTTTTTAAAGCTGCTGAAGTTTGTTG^ 

C AAT AAAGTTTG AAG AAC TTTT AC C AGGTTTTTT T TAT C GC TG C C TTG AT AT ACA CTTTT C AAAATG C T T TO 
GTGGGAAG AAG TAGAGGACTCTT ATG AAAGAGAAG ATGT TC AAAAGAAAAC ATTC AC AAAATGGGT AAATGC 

ACAATTTTCTAAGTTTGGGAAGCAGCATATTOAGAACCT 
AGACCTCCTCGAAGGCCTOACAGGGCAAAAACTGCCAAAA^ 

C AATGTCAACAAGGC AC TGCG GGTTTTGC AGAACAATAATCTTG ATTTAG TGAATATTGGAAGTACTGAC AT 
CGTAGATGGAAATCATAAACT^ACTCTTGGTTTCATTTGGAATATAATCCTCCACTGGCAGGT 
AATGAAAAATATCATGGCTGGATTGCAACAAACCAACAGTGAAAAGATTCTCCTGAGCTGGGTCCGAC^ 
AACTCGTAATTATCCACAGGTTAATGTAATCAACTTCACCACCAGCTGGTCTGATGGCCTO 
TCTCATCCATAGTCAT AG GCCAGAC CTATTTG ACTGGAAT AGTGTGGTTTGCCAGCAGTC AG CC AC AC AACG 
ACTGG AACATGC ATTC AACATCG CCAGATATC AATTAGGCATAG AG AAAC T ACTCG ATCCTGAAG ATGTTGA 
TACC ACCT ATCCAGAT AAGAAGTCC ATCTTAATGTAC ATCACATCACTCT^C C AAGTTTTG CCTCAAC AAGT 
GAGCATTGAAGCCATCCAGGAAGTGGAAATGTTGCCAAGGCCACCTAAAGTGACTAAAGAAGAACATTrTC^ 
GTTACATCATCAAATGCACTAT^CTCAACAGATCACGGTCAGTCTAGCACAGGGATATGAGAGAACTTCTTC 
CCCTAAGCCTCGATTCAAGAGCTATGCCTACACACAGGCTGCTTATGTCACCACCTCTGACCCTACACGGAG 
CCCATTTCCTTCACAGCATTTGGAAGCTCCTGAAGACAAGTCATTTGGCAGTTCATTGATGGAGAGTOAAG 
„ AAACCTGaACCGTTATCAAACAGCTTTAGAAGAAGTATTATCGTGGCTTCTTTCTGCTGAGGAC^ 
^ AG CAC AAGG AG AGATTT CT AATG ATG TG G AAG TGGTG AAAG AC CAG TTT C AT ACT C ATG AG GG G T ACATG AT 

" GGATTTOACAGCCCATCAGGGCCGGGTTXK3TAATATTCTACAATTGGGAAQTAAGCTCATTC 
rd ATTATCAGAAGATGAAGAAACTGAAGTACAAGAGCAGATGAATCTCCTAAATTCAAGAT^ 

Uj QGTAGCTAGCATOGAAAAACAAAGCAATTTACATAGAGTTTTAATGGATOT 

0 GTTGAAaX}ACTGGCTAACAAAAACAGAAGAAAGAACAAGGAAAATGGAGGAAGAGCCTCTTGG^ 

03 TGAAGACCTAAAACGCCAAGTACAACAACATAAGGTGCTTCAAGAAGATCTAGAACAAGAAC 

£ CAATTCTCTCACTCACATGGTGGTC^TAGTTGATGAATCTAGTGGAGATCACGC^ 

ffl ACAACTTAAGGTATT<KK3AGATCGATGGGCAAACATCTGTAGATGGACAGAAGA^ 

AG ACATC CTTCTC AAATGGCAACGTCTTACTG AAGAACAGTGCCTTTIT AGTGC AT^ 

L AG ATG C AG TG AAC AAG ATTC AC AC AAC TGGCTTT AAAG ATC AAAATG AAATO TT AT C AAG TCTT CAAAAAC T 

JI GGCCGTTTTAAAAGC<3GATCTAGAAAAGAAAAAGCAATCCATGGGCAAACTGTATTCACT 

O TCITTCAACACTGAAGAATAAGTCAGTGACCCAGAAGACGGAAGCAIK}GCTG^ 

□ GGAT AATTTAGTCC AAAAACTTGAAAAG AGT AC AGCACAG ATTTC ACAGG CTGTCAC CACCACTC AG CCATC 

CH AC T AACACAG AC AACTG T AAT G G AAA CAGT AAC T ACG G TG AC C AC AAG G G AACAGAT C CTGGT AAAG C AT GC 

Q TCAAGAGGAACnTCCACCACCACCTCCCCAAAAGAAGAGGCAGATTACTGTGGATCT^ 

& AOTCAJWJAGGCCAa^TOAGOT 

CGTGGGCGATCTCCTCATTGACTCTCTCCAAGATCACCTCGAGAAAGTCAAGGCACITC 

GCCTCTGAAAGAGAACGTGAGCCACX3TCAATGACCTTCCTCGCCAGCTTACCACTTTGGGCATT 

ACCGTATAACCTCAGCACTCTGGAAGACCTGAACACCAGATGGAAGCTTCT^ 

AGTCAGGCAGCTGCATGAAGCCCACAGGGACTTTGGTCCA.GCATCT 

GGGTCCCTGGGAGAGAGCCATCTCGCCAAACAAAGTGCCCTACTATATCA^^ 

CTGGGACCATCCCAAAATCACAGAGCTCTACCAGTCTTTAGCTGACCTX3 

TAGGACTGCC ATG AAAC TCCGAAGACTO CAG AAGG CCCTTTGCTTGGAT 

TGATGCCTTGGACCAGCACAACCTCAAGCAAAATGAC(^GCCCATGGATATCCrrc 

GACC ACTATTTATGACCGCC TG GAG CAAGAG C ACAACAAT/TIX^TC AACGTC CCTCTCn^CGTGGATATGTG 

TCrTGAACTGGCK5CTGAAT<5TTTATGATACGGGAraAACAGGGAGG 

CATCATTTCCCTGTGTAAAGCACATTTGGAAGACAAGTACAGATACCTT^ 

AGGATTTTGTGACCAGCGCAGGCTGGGCCTCCTTCTC 

AGTTGCATCCTTTGGGGGCAGTAAC ATTG AG CCAAGTGTC CGGAG CTGCfTTCCAATTTGCT AATAATAAGCC. 

AGAGATCGAAGCGGCCCTCTTCCTAGACTGGATGAGACTGGAACCCC^^ 

GCACAOAGTGGCTGCTGCAGAAACTCCCAAGCATCAGGCCAAATGTAACATCTGCAAAGAGTG 

TGGATTCAGGTACAGGAGTCTAAAGCACTTTAATTATGACATCTO 
TGCAAAAGGCCATAAAATGCACTATCCCATGGTGGAATATTGC^ 
AGACTTTGCCJU^TACTAAAAAACAAATTTaSAACCAAAAG 
CCTGCCAGTGCAGACTGTCTTAGAGGGGGACAACATC 



FIGURE 12 (cont.) 



TGAT ACTC ATTCACGCATTGAAC ATTATG CT AGC AGG CT AGC^GAAATGGAAAAC AGCAATGG ATCTTATCT 

AAATGATAGCATCTCTCCTAATGAGAGCATAGATGATGAACATTTGTTAATCCAGCATTAC^ 

GAACCAGGACTCCCCCCTGAGCCAGCCTCGTAGTCCTGCCCAGATCTTGATTTCCTTAGAGAGTGAGGAAAG 

AGGGGAG CTAG AG AGAATCCTAGCAGATCTTGAG G AAG AAAACAGG AATCTG CAAG CAGAATATG ACCGTCT 
AAAGCAGCAGCACGAACATAAAGGCCTGTCCCCACTGCCGTCCCCTCCTGAAATGATGCCCACCTCTCCCCA 

GAGTCCCCGGG ATG CTGAGCTC ATTG CTG AGG CC AAGC T ACTGCGTCAAC AC AAAGGCCGCCTGGAAG C C AG 
GATGCAAATCCTGGAAGACCACAATAAACAGCTGGAGTCACAGTTACACAGGCTAAGGCAGCTGCTGGAGCA 
ACCCCAGGCAGAGGCCAAAGTGAATGGCACAACGGTGTCCTCTCCTTCTACCTCTCTACAGAGGTCCGACAG 

CAGTCAGCCTATGCTGCTCCGAGTGGTTGGCAGTCAAACTTCGGACTCCATGGGTGAGGAA 

TCC TCCCCAGGACAC AAGC AC AGGGTTAGAGG AGG TGATGG AGC AACTC AACAACT CCTTCCCTAGTTC AAG 

AGGAAGAAATACCCCTGGAAAGCCAATGAGAGAGGACACAATGTAGGAAGTCTTTTCCACATG GCAG ATGAT 

TTGGGCAGAGCGATGGAGTCCTTAGTATCAGTCATGACAGATGAAGAAGGAGCAGAATAAATGTTTTACAAC 

TCCTCATTCCCGCATGGTTTTTATAATATTCATACAACAAAGAGGATTAGACAGTAAGAGTTTACAAGAAAT 

AAATCTATATTTTTGTGAAGGGTAGTGGTATTATACTGTAGATTTC^^ 
TGTTAACAATGGCAGGTTTTACACGTCTATGCAATTGTACAAAAAAGT^ 

TTGATAGCTAAATAACTTGCCATTTCTTTATATGGAACGCATTTTGGGTTGTTTAAAAATTTATAACAGTTA 

TAAAGAAAGATTGTAAACTAAAGTGTGCTTTATAAAAAAAAGTTGTTTATAAAAACCCCT 

AAC AC AC ACACACACACAT AC AC ACACACACACAAAACTTTGAGGCAG CG C ATTGTTTTG CATCCTTTTGGC 

GTGATATCCATATGAAATTCATGGCTTTTTCTTTTTTTGC^ 

CCAAATGACTACTACACACTGCTCATTTGAGAACT 

ATATCTATATGTCTATAAGTATATAAATACTATAGTTATATAGATAAAGAGATACGAATTTCTATAGACTGA 
C T TTTTC CATTTTTT AAATG TT C ATGT C AC ATC C T AAT AGAAAG AAATT AC T T C T AGT C AGTCATC C AGG CT 
TACCTGCTTGGTCTAGA 



FIGURE 13 (AR2-R21, SEQ ID NO: 4 0) 



GGGATTCCCTCACTTTCCCCCTACAGGACTCAGATCTGWAGGCAATTACCTT^ 

caataaagtitoaagaacttttaccaggttttttt™ 
gtgggaagaagtagaggactgttatgaaagagaagatgttcaaaaga^ 

ACAATTTTCTAAGTTTGGGAAGCAGCATATTGAGAACCTCTTCAGTGACCTAC 

AGACCTCCTCGAAGGCCTGACAGGGCAAAAACTGCCAAAAGAAAAAGGATCCACAAGAGTTCATGCCCTCAA 

CAATGTCAACAAGGCACTGCGGGTTTTGCAGAACAATAATGTTGA 

CGTAGATGGAAATCATAAACTGACTCTTOTTTTGATTTGGAATATAATCCTCC^^ 

AATCAAAAATATCATGGCTGGATTGCAACAAACCAACAGTGAAAAGATTCT 

AACTCGTAATTATCCACAGGTTAATGTAATCAACT^ 

TCTCATCCATAGTCATAGGCCAGACCTATTTGACTGGAATAGTGTGGTTTGCCAGCAGTCAGCCACACAACG 
" ACTGG AACATG C ATTCAAC ATCGCCAGATATC AATT AGGCAT AG AGAAACT ACT CGATCCTGAAGATGTTGA 

TACCACCTATCCAGATAAGAA^^^ 

GAGCATTGAAGCCATCC^GGAAGTCGAAATGTTGCCAAGGCCACCTAAAGTGACTAAAGAAG 
GTTACATCATC AAATGCACT ATTCTCAAC AG ATCACGGTC AGTC TAG CACAGQG ATATGAG AGAACTTCTTC 
CCCTAAGCCTCGATTCAAGAGCTATGCCTACACACAGGCTGCTTATGTCACC^C 

CCCATTTCCTTCACAGCATTTGGAAGCrc 
« AAACCTGGACCGTTATCAAACAGCTTTAGAAGAAGTATTATCGTGGCTTCTTTCTQCTG^ 

£ AGCACAAGGAGAGATTTCTAATGATGTGGAAGTGGTGAAAGACCAGTTTCATACTCATO 

9 GGATTTGACAGCC CATC AGGG CCGGGTTGGTAATATTCTr AC AATTO^ 

fU. ATTATCAGAAGATGAAGAAACTGAAGTACAAGAGCAGATGAATCTCCTAAATTCAAGATG^ 

W GGTAGCTAGCATGGAAAAACAAAGCAATTTACATCATAGATTACTGCAACAGTTCCCCCTGGACCTGGAAAA 

03 GTTTCTTGC CT<3GCTTACAGAAGCTGAAACAACTGCCAATCTCCTACAGGATG 

EG C CT AG AAG AC TC C AAGGG AG T AAAAGAG CTG ATG AAACAATGG C AAG AC C T C C AAG GTG AAATTG AAG CT CA 

£ CAC AGATG TTT ATC ACAACCTGG ATGAAAACAGC C AAAAAATCCTG AGATC CCTG G AAGGTT CCGATG ATGC 

5 AG TC CTGTT AC AAAG A CG TTTGG AT AACATG AACTTC AAG TGGAG TG AACTT CGGAAAAAGT CT CT C AAC AT 

7 TAGGTCCCATTTCGAAGCC^GTTCTGAC 

L QCT ACAGCTGAAAGATGATGtfVATTAAGCCGGC AG GCACCT ATTGGAGG CGACTTTCCAGCAGTTC AG AAGCA 

E G AAC GAT G T ACAT AGGG C C TTC AAG AGGG AATTG AAAACT AAAG AAC C T<3 T AAT CATGAGT ACTC TTGAG AC 

S TGT ACG AAT ATTTCTGAC AGAGC AGCCTTTCG AAGG ACT AGAGAAACTCTACC AGGAG CCCAGAGAGCTG C 

Q TCCTCAC^AGAGAGCCCAGAATGTCACTCGGCTTCTACGAAAGCAGGCTOAGGAGGTCAATACT 

£D aa^ttgaaccTGCACTCCGCTQACTGGCAGAGAAAAATAGATCAGAC 

Q agaggccacggatgagctogacctcaagctgcgccaagctgaggtgatc^ 

Q CQATCTC ctcattgactctctccaagatcacctcgagaaagtcaaggcacttcgaggagaaa 

GAAAGAGAACGTGAGCCAC<3TCAATGACCTTGCTCGCCAGCTTACCACTTT^ 

taacctcagcactctggaagacctgaacaccagatggaagcttct^caggtc^ccgtcgaggaccgagtcag 

GCAGCTGCATGAAGCCCACAGGGACnTTGGTCCAGCATCTC 

CTXSGGAGAGAGCCATCTCGCCAAACAAAGTGCCCTACTATATCAACCACGAOACTCAA^ 

ccatcccaaaatgacagagctctaccagtctttagctgacct^ 

TGCCATGAAACTCCGAAGACTOCAGAAGGCCCTITGCT^ 

ctogaccagcacaacctcmgcaaaat<3accagcccatggatatcctgcaga™ 
tatttatgaccgcctggagcaagagcacaacaatttc^tcaacgtc^^ 

ctgg ctgc TGAATGTTT ATG AT AC g gg acg aacaggg AG gatc cgtg tc ctgt chttt aaaactgg catcat 

ttccctctgtaaaggacatttogaagacaagtac^^ 

ttgtoaccagcgcaggctgc^cctccttctgcatgattctatccaaattc 

ATCCTTTGOGGGC AGT AAC ATTG AGCC AAGTGTCCGG AGCTGCTTC CAATTTG CTAAT AATAAGCCAG AGAT 

cgaagcggcccrcttcctagactggatcagactggaacccc^ 
agtggctgctocagaaactcccaagcatc^c^ccaaatgtaacatcrrc 

cac^tacagc^gtctaaagcactttaattatgacatc^ 

AGGCCATAAAATGCACTATCCCATXKSTGGAATAT^ 

TG CCAAGGTACTAAAAAACAAATTTCG AACC AAAAGGTATTTT^ CATCCCCGAATGGGCT ACCTGCC 

AGTGCAGACTGTCTTAGAGGGXjGACAAC^TGGAAACGCCTCCCTCGTCCCCT 

TCATTC^CGCATTGAACATTATGCTAGCAGGCrrAGC^.GAAATGGAA 



FIGURE 13 (cont.) 



TAG CAT C TC TC C T AATG AG AGC AT AG ATG ATG AACATTTGTT AAT C C AG C ATT ACTG CCAAAGTTTG AAC CA 
(KJACT^CCCTOAGC^ 

G CTAGAG AG AATCCTAGCAGATCTTG AGGAAG AAAAC AGG AAT CTG C AAGC AG AAT ATG ACCGTCTAAAGC A 
GCAGCAOSAACATAAAGGCCTGTCCCCACra^ 

CCGGGATGCTGAGCTCATTGCTGAGGCCAAGCTACTGCGTCAACACAAAGGCCGCCTGGAAGCCAGGATGCA 
AATC CTGG AAGACCACAATAAAC AGCTGGAGTC AC AG TT ACAC AGGCTAAGGC AGC TG CTGGAGCAAC C CCA 
GGCAGAGGCCAAAGTGAATGGC71CAACGGTGTCCTCTCCTTCTACCTCTCTACAGAGGTCCGACAGCAGTCA 
GCCTATGCTGCTCCGAGTGGTTGGCAGTCAAACTTCGGACTCQATGGGTGAGGAAGATCTTCTCAGTCCTCC 
CCAGGACACAAGCACAGGGl^AGAGGAGGTGATGGAGCAACTCAACAACTCCTTCCCTAGTTCAAGAGGAAG 
AAAT ACC C CTGGAAAG C C AAT GAG AG AGG AC AC AATG T AGG AAGTCTTTT CC ACATGG CAGATG ATTTG GG C 
AGAGCG ATGGAGTCCTT AGT ATC AGTCATGAC AG ATGAAGAAGGAG CAG AAT AAATGTTTT ACAACTCC TGA 
TTCCCGCATGGTTTTTATAATATTCATACAACAAAGAGGATTAGACAGTAAGAGTTTACAAGAA^ 

ATATTTTTGTGAAGGGTAGTCGTATTA™ 

CAATGGCAGGTTI^ACACGTCrTATGCAATTGTACAAAAAAGTTATAAGAAAACTACATG 
GOT AAAT AACTTGCCATTTCTTTATATGGAACGCATTTTGGGTTGTT^ 

AAGATTGTAAACTAAAGTCTCCTTTATAAAAAAAAGTTGTTTATAAAAACCCCTAAAAACAAAACAAACACA 

CACACACACACATACACACACACACACAAAACTTTGAGGCAGCGCATTGTT^ 

TCCATATGAAATTCATGGCTTTrTCTTTTTTTGC^^ 
® GA cTACTACACACrocrCATTTGAGAACTGTCAGCTGAGTCGGGCAGG 

O ATATXSTCTATAAGTATATAAATACTATAGTTATATAGATAAAGAGATACGAATTTCTATAGACTGA^i-ri-iT 
nJ CCATTTTTT AAATGTT CATGTCACATCC T AATAGAAAG AAATT ACTTCTAGTC AGTC ATCC AGGCTTACCTG 

W CTTGGTCTAGA 

CO 

00 

® 

t 

□ 
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d 



FIGURE 14 (AR2-R21+H3, SEQ ID N0:41) 

' ========= 

A^^CGAAGGCCTGACAGC^CAAJ^^ 
CAATGTC^CAAGGC^ 

CGT AG ATGG AAATCATAAACTGACT CTTGGTTTGATTTGGAATAT AATCCTC CACTGG CAGGTCAAAAATGT 
AACTCGTAATTATCCACAGGTTAATGTAATCAACrrCACCACCAGCr^ 

TCTCATCCATAGTCATAGGCCAGACCTATTTGACTGGAATAGTGTGGTTTGCCAGCAGTCAGCCACACAACG 
ACTGGAACATGCATTCAACATCGCCAGATATCAATTAGGCATAGAGAAACTACTCGATCCTGAAGATGTTGA 

TACCACCTATCCAGATAAGAAGTCCATOT^ 

GAGCATTGAAGCCATCCAGGAAGTGGAAATGTTG^CAAGGCCACCTAAAGTC 

GTTACATC ATC AAATGC ACT ATTC TC AAC AG ATCACGGTC AGTCT AG CAC AGGGATATG AG AG AAC TTCTTC 
CCCTAAGCCTCGATTCAAaAGCTATGCCTACACACAGGCTGCTTATGTCACCACCTCTGACCCTACACGGAG 

CCCATTTCCTITCACAGCATTTGGAAGCTCCTGAAGACAAGTCATTTGGCAGTTCATTGATGG 

AAACCTGOACCGTTATCAAACAGCTTTAGAA^ 

« AGCACAAGG AG AGATTT CTAATG ATGTG GAAGTGGTG AAAG AC C AGTTTC AT ACTC ATG AGGGGT ACATGAT 

°- GG ATTTGACAG CCC ATC AGGG CCGGGTTGGTAATATTCTAC AATTGGG AAGTAAGCTGATTGG AAC AGG AAA 

ni ATTATCAGAAGATGAAGAAACTGAAGTACAAGAGCAGATGAATCTCCTAAATTCAAGATGGGAATGCCTCAG 

W GGTAG CT AGC ATGGAAAAACAAAGC AATTTACATGCTCCTrGGACTG ACC ACTATTGGAG CCTCTCCTACTC A 

BJ GACTGTT ACT C T G G TGAC ACAAC CTGT G GTT ACT AAGG AAAC T G CCAT CT C CAAACT AG AAATG C CATCTT C 

CO cnTGATGTTGGAGCATAGATTACTGCAACAGTTCCCCCTGGACCTGGAAAAGTT^ 

r AGCTGAAACAACTGCCAATGTCCTACAGGATGCTACCCGTAAGGAAAGGCTCCTAGAAGACTCCAAGGGAGT 

5 AAAAGAGCTGATGAAACAATGGCAAGACCTCCAAGGTGAAATTGAAGCTCACACAGATGTTTATCACAACCT 

' GGATGAAAACAGCCAAAAAATCCTGAGATCCCTGGAAGGTTCCGATGATGCAGTCCTGTTAC^ ■ 

L GGATAACATX3AACTTCAAGTGGAGTGAACTTCGGAAAAAGTCTCTCAACATTAGGTCCCA 

« TTCTGACCAGTCGAAGCGTCTGCACCTTTCTCTGCAGGAACTTCTGGTGTGGOT 

Q ATTAAGCCGGCAGGC^CCTATTGGAGGCGACTTTCCAGCAGTTCAGAAGCAGAACGATGTA(^TAGGGCCTT 

G CAAGAGGGAATTGAAAACTAAAGAACCTGTAATCATGAGTACTCTTGAGACTGTACGAATATTTOT 

W GCAGCCTTTGGAAGGACTAGAGAAACTCn'ACCAGGAGCCCAGAGAGCTGCCTCCTGAGGAGAGAGCCCAGAA 

□ TGTC ACTCGGCTTCTACG AAAG C AGGCTGAGGAGGTC AAT ACTG AGTGGGAAAAATTGAACC TG CACTC CGC 

Q TOACTGGCAGAGAAAAATAGATGAGACCCTTGAAAGACTCCAGGAACTTCAAGAGGCCACGGATGAGCTGGA 

CCTCAAGCTGCGCCAAGCTGAGGTGATCAAGGGATCCTGGCAGCCCGTGGGCGATCT 
CCAAQATCACCTC<»GAAAGTCAAiK^ 

CAATGACCTTGCTCGCCAGCTTACCACTTTGGGCATTCAGCTCTCACCGTATAACCTCAGCACT 
CCTGAJVCACC AJGATGG AAG CTTCTGC AGGTGG CCGTCGAGG ACCGAGTCAGGCAGCTG 
GGACTTTGGTCCAGCATCTCAX3CACTTTCTTTCCACGTCTOTCCAGGGTCCCTGGG 

AAACAAAGTGCCCTACTAT ATC AACCACG AG ACTC^AACAACTTGCTGGG ACCAT CT 
CTACCAGTCTTTAGCTGACCTGAATAATGTGAGATTCTCAGCTTATAG^ 

GCAGAAGGCCCTTTG CH!T^ ATCTCTTG AG CCTGTCAG C TG CATGTGATGCCTTGG ACCAGCACAACCTCAA 

GCAAAATGACCAGCCCATGGATATCCTGC^GATTATTAATTGT^ 

AGAGCACAAGAATTTGGTGAACGTCCCTCTCTGCG 

T ACGGGACGAAC AGGG AGG ATC CGTGTCCTGTCTTTTAAAACTGGC ATC ATTTCCCTGTGTAAAGCACATTT 

GG AAG AC AAGT ACAGATAC CTTTTC AAG C^^GTGGC AAGTTCAAC AGG ATTTTGTGACC AGCG CAGGCTGGG 

CCTCCTTCTGC^TGATTCTATCCAAATTCCAAGACAGTTGGGTGAAGTTOCATCCTTTC 

TGAGC C AAG TGT C C GGAG C TG CTTC CAAT T T G CT AAT AAT AAG C CAGAGATCG AAG CGGCC C T C TT C CT AG A 

C TG GATGAGACTGG AAC C C CAGT C CATGGTGTGG C TG C CCG TC C TGC ACAG AG TGGCIX^CTGC AG AAACTG C 

CAAGC ATCAGG CCAAATGT AACATCTGCAAAG AGTGT CCAATC ATTGGATTCAOT C A 

CTTTAATTATGACATCTGCCAAAGCTGCnTTTTTTCT^ 
CATGQTGGAATATTQCACTCCGACTACATCAGQAGAAGATGTTCGA6M 

ATTTCGAACCAAAAGGT ATTTTGCGAAGCATCCCCG AATGGGCT ACCTG CCAGTG CAGACTGTCTT AG AGGG 



FIGURE 14 (cont.) 



GGACAACATGGAAACGCCTGCCTCGTCCCCTCAGCTTTCACACGATGATACTCATTCACGCATTGA^ 
TGCTAGCAGGCTAGCAGAAATGGAAAACAGCAATGGATCTTATCTAAATGATAGCATCTCTCCTAATGAGAG 

c at ag atgatg aac atttgtt aatccagcattactg ccaaagtttgaaccaggactccc ccctga 
tcgtagtcctgcc cag atcttg atttcctt ag agagtg agg aaag agggg ag ct agag agaatcct agc ag a 
tcttgaggaagaaaacaggaatctgcaagcagaatatgaccgtctaaagcagcagcacgaacataaaggcct 
gtccccactgccgtcccctcctoaaatgatgcccacctctccccagagtccccgggatgctgagctcattgc 

tgaggccaagctactgcgtcaacacaaaggccgcctggaagccaggatgcaaatcctggaa^ 
acagctggagtcacagttacacaggctaaggcagctgctggagcaaccccaggcagaggccaaagtgaatgg 

cacaac^tgtcctctccttctacctctctacagaggtccgacagcagtcagcctatgctgctc 

tg g cagt caaactt c gg ac t c catgg gtg agg aagatctt ct c agt c ct c c c cagg ac ac aagc ac ag g g tt 

agaggaggtgatggagcaactcaacmctccttccctagttcaagaggaagaa^ 

gagagaggacacaatgtaggaagtcttttccacatggcagatgatttgggcaaagcgatggagtccttagta 

tcagtcatgacagatgaagaaggagcagaataaatgtt™ 
attcatacaacaaagaggattagacagtaagagtttacaagaaataaatctatattt^ 

gtattatactotagatttcagtagtttct^ 

t atg c aattgt ac aaaaaagtt at aag aaaact ac atg t aaaat c ttgat ag ct aaat aac ttgc cattt c t 

TTATATOGAACGCATTTTGGGTTGTTTAAAAATTTATAACAGTTATAAAGAAAGATTC 

m C TTT AT AAAAAAAAGTTG T T T AT AAAAAC C C C T AAAAAC AAAAC AAAC ACACAC ACACACAC AT AC AC AC AC 

^ ACACACAAAACTTTCAGGCAGCGCATTGTTITGCATCCTT1TGGC 

D TTTCTTTTTTTGCATATTAAAGATAAGACTTCCTCTACCACCACACCAAA 

fit TOAGAACTGTCAGCT^^GTGGGGCAGGCTTG 

W TACTATAGTTATATAGATAAAGAGATACGAATTTCTAT 

P3 CACATCCT AAT AGAAAGAAATTAC TTCT AG TCAGTC ATCCAGGCTT ACCTG CTTGGTCT AG A 

CD 
O 
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o 
□ 



FIGURE 15 (AH2-R19, SEQ ID NO: 42) 



gggattccct cactttcccc ctacaggact 
aaacgaatag gaaaaactga agtgttactt 
tcattgtttt taagcctact ggagcaataa 
atcgctgcct tgatatacac ttttcaaaat 
tgaaagagaa gatgttcaaa agaaaacatt 
gtttgggaag cagcatattg agaacctctt 
agacctcctc gaaggcctga cagggcaaaa- 
tcatgccctg aacaatgtca acaaggcact 
agtgaatatt ggaagtactg acatcgtaga 
ttggaatata atcctccact ggcaggtcaa 
gcaacaaacc aacagtgaaa agattctcct 
tccacaggtt aatgtaatca acttcaccac 
tctcatccat agtcataggc cagacctatt 
agccacacaa cgactggaac atgcattcaa 
actactcgat cctgaagatg ttgataccac 
catcacatca ctcttccaag ttttgcctca 
ggaaatgttg ccaaggccac ctaaagtgac 
aatgcactat tctcaacaga tcacggtcag 
ccctaagcct cgattcaaga gctatgccta 
ccctacacgg agcccatttc cttcaca'gca 
cagttcattg atggagagtg aagtaaacct 
attatcgtgg cttctttctg ctgaggacac 
tgtggaagtg gtgaaagacc agtttcatac 
ccatcagggc cgggttggta atattctaca 
attatcagaa gatgaagaaa ctgaagtaca 
ggaatgcctc agggtagcta gcatggaaaa 
tctccagaat cagaaactga aagagttgaa 
aaggaaaatg gaggaagagc ctcttggacc 
acaacataag gtgcttcaag aagatctaga 
tcacatggtg gtggtagttg atgaatctag 
acaacttaag gtattgggag atcgatgggc 
ggttctttta caagacatcc ttctcaaatg 
tagtgcatgg ctttcagaaa aagaagatgc 
agatcaaaat gaaatgttat caagtcttca 
aaagaaaaag caatccatgg gcaaactgta 
gaagaataag tcagtgaccc agaagacgga 
ggataattta gtccaaaaac ttgaaaagag 
cag cctgacctag ctcctggact 
gaccactatt ggagcctctc ctactcagac 
taaggaaact gccatctcca aactagaaat 
tctggcagat ttcaaccggg cttggacaga 
agttataaaa tcacagaggg tgatggtggg 
caagcagaag gcaacaatgc aggatttgga 
taccgctgcc caaaatttga aaaacaagac 
ggatcgaatt gaaagaattc agaatcagtg 
gaggcaacag ttgaatgaaa tgttaaagga 
agctgagcag gtcttaggac .aggccagagc 
tacagtagat gcaatccaaa agaaaatcac 
ccagtggcag acaaatgtag atgtggcaaa 
ttctgcagat gataccagaa aagtccacat 
aagcattcat aaaagggtga gtgagcgaga 
gcaacagttc cccctggacc tggaaaagtt 
tgccaatgtc ctacaggatg ctacccgtaa 
aaaagagctg atgaaacaat ggcaagacct 
ttatcacaac ctggatgaaa acagccaaaa 
tgcagtcctg ttacaaagac gtttggataa 
aaagtctctc aacattaggt cccatttgga 
cctttctctg caggaacttc tggtgtggct 
ggcacctatt ggaggcgact ttccagcagc 
caagagggaa ttgaaaacta aagaacctgt 
atttctgaca gagcagcctt tggaaggact 



cagatctggg aggcaattac cttcggagaa 
tttttaaagc tgctgaagtt tgttggtttc 
agtttgaaga acttttacca ggtttttttt 
gctttggtgg gaagaagtag aggactgtta 
cacaaaatgg gtaaatgcac aattttctaa 
cagtgaccta caggatggga ggcgcctcct 
actgccaaaa gaaaaaggat ccacaagagt 
gcgggttttg cagaacaata atgttgattt 
tggaaatcat aaactgactc ttggtttgat 
aaatgfcaatg aaaaatatca tggctggatt 
gagctgggtc cgacaatcaa ctcgtaatta 
cagctggtct gatggcctgg ctttgaatgc 
tgactggaat agtgtggttt gccagcagtc 
catcgccaga tatcaattag gcatagagaa 
ctatccagat aagaagtcca tcttaatgta 
acaagtgagc attgaagcca tccaggaagt 
taaagaagaa cattttcagt tacatcatca 
tctagcacag ggatatgaga gaacttcttc 
cacacaggct gcttatgtca ccacctctga 
tttggaagct cctgaagaca agtcatttgg. 
ggaccgttat caaacagctt tagaagaagt 
attgcaagca caaggagaga tttctaatga 
tcatgagggg tacatgatgg atttgacagc 
attgggaagt aagctgattg gaacaggaaa 
agagcagatg aatctcctaa attcaagatg 
acaaagcaat ttacatagag ttttaatgga * 
tgactggcta acaaaaacag aagaaagaac 
tgatcttgaa gacctaaaac gccaagtaca 
acaagaacaa gtcagggtca attctctcac 
tggagatcac gcaactgctg ctttggaaga 
aaacatctgt agatggacag aagaccgctg 
gcaacgtctt actgaagaac agtgcctttt 
agtgaacaag attcacacaa ctggctttaa 
aaaactggcc gttttaaaag cggatctaga 
ttcactcaaa caagatcttc tttcaacact 
agcatggctg gataactttg cccggtgttg' 
tacagcacag atttcacag 

tgttactctg gtgacacaac ctgtggttac 
gccatcttcc ttgatgttgg aggtacctgc 
acttaccgac tggctttctc tgcttgatca 
tgaccttgag gatatcaacg agatgatcat 
acagaggcgt ccccagttgg aagaactcat 
cagcaatcaa gaggctagaa caatcattac 
ggatgaagta caagaacacc ttcagaaccg 
ttcaacacaa tggctggaag ctaaggaaga 
caagcttgag tcatggaagg agggtcccta 
agaaaccaag cagttggcca aagacctccg 
tgacttggcc ctgaaacttc tccgggatta 
gataacagag aatatcaatg cctcttggag 
ggctgctttg gaagaaactc atagattact 
tcttgcctgg cttacagaag ctgaaacaac 
ggaaaggctc ctagaagact ccaagggagt 
ccaaggtgaa atcgaagctc acacagatgt 
aatcctgaga tccctggaag gttccgatga 
catgaacttc aagtggagtg aacttcggaa 
agccagttct gaccagtgga agcgtctgca 
acagctgaaa gatgatgaat taagccggca 
tcagaagcag aacgatgtac atagggcctt 
aatcatgagt actcttgaga ctgtacgaat 
agagaaactc taccaggagc ccagagagct 



FIGURE 15 (cont-) 



gcctcctgag gagagagccc agaatgtcac 
caatactgag tgggaaaaat tgaacctgca 
gacccttgaa agactccagg aacttcaaga 
ccaagctgag gtgatcaagg gatcctggca 
ccaagatcac ctcgagaaag tcaaggcact 
cgtgagccac gtcaatgacc ttgctcgcca 
gtataacctc agcactctgg aagacctgaa 
cgaggaccga gtcaggcagc tgcatgaagc 
ctttctttcc acgtctgtcc agggtccctg 
ctactatatc aaccacgaga ctcaaacaac 
ctaccagtct ttagctgacc tgaataatgt 
actccgaaga ctgcagaagg ccctttgctt 
tgccttggac cagcacaacc tcaagcaaaa 
taattgtttg accactattt atgaccgcct 
ccctctctgc gtggatatgt gtctgaactg 
agggaggatc cgtgtcctgt cttttaaaac 
ggaagacaag tacagatacc ttttcaagca 
gcgcaggctg ggcctccttc tgcatgattc 
tgcatccttt gggggcagta acattgagcc 
taataagcca gagatcgaag cggccctctt 
catggtgtgg ctgcccgtcc tgcacagagt 
caaatgtaac atctgcaaag agtgtccaat 
ctttaattat gacatctgcc aaagctgctt 
aatgcactat cccatggtgg aatattgcac 
ctttgccaag gtactaaaaa acaaatttcg 
aatgggctac c'tgccagtgc agactgtctt 
tctgatcaac ttctggccag tagattctgc 
tgatactcat tcacgcattg aacattatgc 
tggatcttat ctaaatgata gcatctctcc 
aatccagcat tactgccaaa gtttgaacca 
tgcccagatc ttgatttcct tagagagtga 
agatcttgag gaagaaaaca ggaatctgca 
cgaacataaa ggcctgtccc cactgccgtc 
gagtccccgg gatgctgagc tcattgctga 
cctggaagcc aggatgcaaa tcctggaaga 
caggctaagg cagctgctgg agcaacccca 
gtcctctcct tctacctctc tacagaggtc 
ggttggcagt caaacttcgg actccatggg 
cacaagcaca gggttagagg aggtgatgga 
aggaagaaat acccctggaa agccaatgag 
atggcagatg atttgggcag agcgatggag 
ggagcagaat aaatgtttta caactcctga 
acaaagagga ttagacagta agagtttaca 
gtggtattat actgtagatt tcagtagttt 
caggttttac acgtctatgc aattgtacaa 
ttgatagcta aataacttgc catttcttta 
ttataacagt tataaagaaa gattgtaaac 
ataaaaaccc ctaaaaacaa aacaaacaca 
actttgaggc agcgcattgt tttgcatcct 
ctttttcttt ttttgcatat taaagataag 
tacacactgc tcatttgaga actgtcagct 
atatctatat gtctataagt atataaatac 
tctatagact gactttttcc attttttaaa 
tacttctagt cagtcatcca ggcttacctg 
ggaagccagg aggaaactac accacactaa 
ttttaaacaa ctttccactg acaacgaaag 
aacatgtgaa tgaatacaca ggacttatta 
attgattgat tgattgatac attcagcttc 
aatgatgctt cagtggaaat caatcagaag 
ttttttaact cccaagcagt agcaggacga 
cccatccctg tgaaggagta ggccactctt 
acataaagtt ctctgtaatt acaactaaat 



tcggcttcta cgaaagcagg ctgaggaggt 
ctccgctgac tggcagagaa aaatagatga 
ggccacggat gagctggacc tcaagctgcg 
gcccgtgggc gatctcctca ttgactctct 
tcgaggagaa attgcgcctc tgaaagagaa 
gcttaccact ttgggcattc agctctcacc 
caccagatgg aagcttctgc aggtggccgt 
ccacagggac tttggtccag catctcagca 
ggagagagcc atctcgccaa acaaagtgcc 
ttgctgggac catcccaaaa tgacagagct 
cagattctca gcttatagga ctgccatgaa 
ggatctcttg agcctgtcag ctgcatgtga 
tgaccagccc atggatatcc tgcagattat 
ggagcaagag cacaacaatt tggtcaacgt 
gctgctgaat gtttatgata cgggacgaac 
tggcatcatt tccctgtgta aagcacattt 
agtggcaagt tcaacaggat tttgtgacca 
tatccaaatt ccaagacagt tgggtgaagt 
aagtgtccgg agctgcttcc aatttgctaa 
cctagactgg atgagactgg aaccccagtc 
ggctgctgca gaaactgcca agcatcaggc 
cattggattc aggtacagga gtctaaagca 
tttttctggt cgagttgcaa aaggccataa 
tccgactaca tcaggagaag atgttcgaga 
aaccaaaagg tattttgcga agcat.ccccg 
agagggggac aacatggaaa ctcccgttac 
gcctgcctcg tcccctcagc tttcacacga 
tagcaggcta gcagaaatgg aaaacagcaa 
taatgagagc atagatgatg aacatttgtt 
ggactccccc ctgagccagc ctcgtagtcc 
ggaaagaggg gagctagaga gaatcctagc 
agcagaatat gaccgtctaa agcagcagca 
ccctcctgaa atgatgccca cctctcccca 
ggccaagcta ctgcgtcaac acaaaggccg 
ccacaataaa cagctggagt cacagttaca 
ggcagaggcc aaagtgaatg gcacaacggt 
cgacagcagt cagcctatgc tgctccgagt 
tgaggaagat cttctcagtc ctcccicagga 
gcaactcaac aactccttcc ctagttcaag 
agaggacaca atgtaggaag tcttttccac 
tccttagtat cagtcatgac agatgaagaa 
ttcccgcatg gtttttataa tattcataca 
agaaataaat ctatattttt gtgaagggta 
ctaagtctgt tattgttttg ttaacaatgg 
aaaagttata agaaaactac atgtaaaatc 
tatggaacgc attttgg^tt gtttaaaaat 
taaagtgtgc tttataaaaa aaagttgttt 
cacacacaca catacacaca cacacacaaa 
tttggcgtga tatccatatg aaattcatgg 
acttcctcta ccaccacacc aaatgactac 
gagtggggca ggcttgagtt ttcatttcat 
tatagttata tagataaaga gatacgaatt 
tgttcatgtc acatcctaat agaaagaaat 
cttggtctag aatggatttt tcccggagcc 
aacattgtct acagctccag atgtttctca 
taaagtaaag tattggat.tt ttttaaaggg 
tatcagagtg agtaatcggt tggttggttg 
ctgctgctag caatgccacg atttagattt 
gtattctgac cttgtgaaca tcagaaggta 
tgatagggct ggagggctat ggattcccag 
taagtgaagg attggatgat tgttcataat 
tattatgccc tcttctcaca gtcaaaagga 



FIGURE 15 (cont.) 



actgggtggt ttggtttttg ttgctttttt 
taaatgccac aagacataat ttaaaataaa 
ccccatcaca tttgtgatac tgacaggtat 
tcctttgcat ttctctgcga gtagttccac 
aaattgattc aaatgttaca aaaaaaccct 
aacaaacaaa caaaaattgc tcaaaaaaga 
ctggtaggaa aaagctttac tctttcatgc 
tcattcaata gataccaccg tgtgacctat 
agtgtaatta gcttttggag agtgggctga 
gttttgtcca ttattaataa ttaattaatt 
.tacctcactt tggttttggg gtgttcctga 
ccacttgtcc attgcgttat tttctttttc 
tttcaaaaga aaacccaaag ctctaaggta 
ttttgtcttg catttttttc ctttatgtga 
taaaactcag atttaaaaca aggggttact 
agtttcattc taaaatcaga ggtaaataga 
gtttttcttt tagacacatt agctctggag 
gagagcttta ttgctgcatt ttaagcataa 
tttataacca ccgagtatta aactgtaaat 
ggcatgtttt gtcattgttt tcaggtactg 
gttttaacac caacactgta acatttacga 
ttttcacaac atatcagact tcaccaaata 
ttactgtgta tctcaataaa gcacgcagtt 



agatttattg tcccatgtgg gatgagtttt 
taaactttgg gaaaaggtgt aagacagtag 
caacccagaa gcccatgaac tgtgtttcca 
acaggtttgt aagtaagtaa gaaagaaggc 
tcttggtgga ttagacaggt taaatatata 
ggagaaaagc tcaagaggaa aagctaagga 
cattttattt ctttttgatt tttaaatcat 
aattttgcaa atctgttacc tctgacatca 
catcaagtgt aattagcttt tggagagtgg 
aacatcaaac acggcttctc atgctatttc 
taattgtgca cacctgagtt cacagcttca 
ctttataatt ctttcttttt ccttcataat 
acaaattacc aaattacatg aagatttggt 
cgctggacct tttctttacc caaggatttt 
ttacatccta ctaagaagtt taagtaagta 
gtgcataaat aattttgttt taatcttttt 
tgagtctgtc ataatatttg aacaaaaatt 
ttaatttgga cattatttcg tgttgtgttc 
cataatgtaa ctgaagcata aacatcacat 
agttcttact tgagtatcat aatatattgt 
attatttttt taaacttcag ttttactgca 
tatgccttac tattgtatta tagtactgct 
atgttac 



FIGURE 16 (SEQ ID NO:87) 



Human skeletal muscle alpha actinin, complete cDNA sequence: 

Genbank accession # M 8 64 06; 4181 base pairs 

GGAACTCCGCTTCGCCCGAGACCCAGCGCCCAGGCGTC^ 

GTCACCCCGCGCCCGCCGCCCGCCGCCCGCCGCCTCCGTGGGTCCGT^ 

GCGTCCGAGCCCCTCGCGCCCCGCCGCAGCCCC^ 

GCCCGGCGTGCAGTACAACTACGTGTACGACGAGGATGAGTACATGATCCAGGAGGAGGAGT 

GGGACCGCGACCTGCTCCTGGACCCAGCCTGGGAGAAGCAGCAGAGGAAGACCTTCACTGCC 

TGGTGTAACTCCCACCTAAGGAAAGCCGGCACCCAGATTGAGAACATCGAGGAAGACITCAG 

GAATGGCCTTAAGCTCATGCTGCTTTTGGAAGTCATCTCAGGGGAAAGGCTGCCCAAAC 

CCGGGGAAAAATGCGGTTCCACAAAATTGCTAATGTCAACAAAGCTITGGATTACATAGCCA 

GCAAAGGGGTGAAACTGGTGTCCATCGGCGCTGAAGAAATTGTTGATGGCAATGTGAAAATG 

ACCCTGGGTATGATCTOGACCATCATCCITCGCTTT 

CATCTGCCAAAGAAGGTCTGCTGCTn , GGTGTCAGAGGAAAACTGCT(XTTATAGA^ 

ACATTCAGAACTTCCATACTAGCTGGAAAGATGGCCTTGGACTCTGTGCCCTX^ATCCACCG 

ACCGGCCTGACCTCATTGACTACTCAAAGCTrAACAAGGATGACCeCATAGGAAATATTAACC 

TGGCCATGGAAATCGCTGAGAAGCACCTGGATATTCCTAAAATGTTGGATGCTGAAGACATCG 

TGAACAC£CCTAAACCCGATGAAAGAGCCATCATGACGTACGTCT^ 

TTGCXJGGCGCCKjAGCAGGCCGAGACAGCGGCTAACAGGATATGT 

CAAGAGAATGAGAGGCTGATGGAAGAATATGAGAGGCTAG<XjAGTGAGCTTTTGGAATGGAT 

tcgtcgcacgatcccctggctggagaaccggactcccgagaagaccatgcaagccatgcaga 

agaagctggaggacttccgggattaccgccggaagcacaagccacccaaggtgcaggagaa 

atgccagctggagatcaacttcaacacgctgcagaccaagctgcggatcagcaaccgtcctg 

ccttcatgcxctccgagggcaagatggtgtcggatattgctggtgcctggcagaggctggagc 

aggctgagaagggttacgaggagtggttgctcaatgagattcggagactggagcgcttggaa 

cacctggctgagaagttcaggcagaaggcxtcaacgcacgagacttgggcttatggcaaa 

GCAGATCTTGCTGCAGAAGGATTACGAGTCGGCXjTCGCTGACAGAGGTGCGGGCTCTGCTGC 

ggaagcacgaggcgttcgagagcgacctggcagcgcaccaggaccgcgtggagcagatcgc 

AGCCATCGCGCAGGAGCTCAATGAACTGGACTATCACGACXjCTGTGAATGTCAATGATCXjGT 

gccagaaaatttgtgaccagtgggaccgactgggaacgcttactcagaagaggagagaagcc 

ctagagagaatggagaaattgctagaaacx:attgatcagcttcacctggagtttgccaagag 

ggctgctccmcaacaattggatggagggcgctatggaggat^ 

(xacagcatrgaggagatccagagtctgatcactgcgcatgagcagttcaaggccacgctgc 

ccgaggcggacggagagcggcagtccatcatggccatccagaacgaggtggagaaggtgatt 

cagagctacaacatcagaatcagctcaagcaacccgtacagcactgtcaccatggatg 

ccggaccaagtgggacaaggtgaagcaactcgtgcccatccgcgatcaatccctgcaggagg 

AGCTGGCTCGCCAGCATGCTAACGAGCGTCTGAGGCGCCAGTTTGCTGCCCAAGCCAATGCCA 

TTGGGCCCTGGATCCAGAACAAGATGGAGGAGATOCCCGGAGCTCCATCCAGATCACAG 

GCCCTGGAAGACCAGATGAACCAGCTGAAGCAGTATGAGCACAACATCATCAACTATAAGAA 

CAACATCGACAAGCTGGAGGGAGACCATCAGCTCATCCAGGAGGCCCTTGTCm 

AGCACACGAACTACACGATGGAGCACATTCGTGTTGGATGGGAGCTGCTGCTGACAACCATC 

GCCAGAACCATCAATGAGGTGGAGACTCAGATCCTGACGAGAGATGCGAAGGGCATCACCCA 

GGAGGAGATGAATGAGTTCAGAGCCTCCTITCAACCACTTTGACAGGAGGAAGAATGGCCT 

TGGATCATGAGGATTTCAGAGCCTGCCTGATTTCCATGGGTTATGACCTGGGTGAAG<X 

TTGCCCGCATTATGACCCTGGTAGATCCCAACG^ 

TOJACnTCATGACTAGAGAGACGGCTGACACCGACACTGCCGAGCAGGTCATC^ 
GGATCCTGGCTTCTGATAAGCCATACATCCTGGCGGAGGAGCTGCGTCGGGAG^ 

ATCAGGCCCAGTACTGCATCAAGAGGATGCCCGCCTACTCGGGCCCAGGCAGTGTGCCTGGTG 
CACTGGATTACCKrrGCGTTCTCTrCCGCA 

CTGTAATGA CTCA TC£CATCAGAATGCAATAAAAGCGGAAGTCACAGT^ 
TTTGACAAGCTTTATTAAGTTGAGAGAGAGAGAGGGGGAAAA 

CAGTAATTG<XXGC^ATATAACACGGCTAAAATGAAGTTTTTACAGTATATGACATAGTGC^ 



FIGURE 16 (cont.) 



TTCATAAATAGGTTTATTTCTGAGTTTTTAGCAA 

gattaaacagaacaaattacttgagtaataggaaattaggaggatctagggacagaaggaaa 

GTGAAAAATGTGAAAATACAAAATACCCAAGATTTAAGACCGGGGGGAAAAAACCACAAATT 

GGTAAATAAAGGTTTGCTATTTGTAAAAAATTTCATTTATCTCTAATATGC^ 

CCTAGGGGAGTATATTTGGGATTCTAATGTTITArmCATGCTrA 

ATCrTTCAAATGAACTTAATATTGTGAGATGGAACTGCCGGGGATTAAAAAGACTACCCAAAA 

GATTTTTGGCACTTACAATTTITAAAATAGTTTATGTCATCTCTrCATT 

GTCAACTCAGTCAGTGATrrnTGATGCTTCTCTTATCCTCCAGAATAGA 

TGGAAGTCAGTTTAATTGCCAGAGAGAAGGATGCAATC^CTAGGTGAAATGAGGTTTTTAGG 

ATT ATTT ATTG ATTCC A GGTTCCC ATGCTTTTTGTT AG A G CTT ATTA GTAC A GGT1CTCAA G A G 
ATGACCACATAAAAGTGCTCTGTTTATAAATAAGCAGGTTTCTGTAGTACTGACTGGTTCATC 

ACAAGGCAAGTCAGAAACCAGTATCCTTCTAG 

TTTTATGACCGGTTAAGGAGAAGCCAGAGTTAGAGTAGGAGAGGACTAATTCTCAGCAGCAG 
TGGAGGTCAGTTCTTTCTTTTGCGGAAGCTITACATATGTTTT 

ATTTTAGCTAGTGTGCGGTGTGTGTrCACCCCTGGGATTGGACAGTGTATCCTAACAAGTCCC 

ATGTCTGGTTCTXSTGTCTAAAGGCCTGCra^ 

CACATCTTGATCTGTTGAATGTrCATTCTTTCnTITrGCTCATACTC 

CCCTG'irmCCATCTTGTTGACAGCTTGTAGAGAATAAAGCAGGAATTC 



FIGURE 17 (16-repeat construct, SEQ ID NO: 44) (numbering 
corresponds to the numbering of human dystrophin, acc. no. M185330 



gggattccct cactttcccc ctacaggact cagatctggg aggcaattac cttcggagaa 
aaacgaatag gaaaaactga agtgttactt tttttaaagc tgctgaagtt tgttggtttc 
tcattgtttt taagcctact ggagcaataa agtttgaaga acttttacca ggtttttttt 
atcgctgcct tgatatacac ttttcaaaat gctttggtgg gaagaagtag aggactgtta 
tgaaagagaa gatgttcaaa agaaaacatt cacaaaatgg gtaaatgcac aattttctaa 
gtttgggaag cagcatattg agaacctctt cagtgaccta caggatggga ggcgcctcct 
agacctcctc gaaggcctga cagggcaaaa actgccaaaa gaaaaaggat ccacaagagt 
tcatgccctg aacaatgtca acaaggcact gcgggttttg cagaacaata atgttgattt 
agtgaatatt ggaagtactg acatcgtaga tggaaatcat aaactgactc ttggtttgat 
ttggaatata atcctccact ggcaggtcaa aaatgtaatg aaaaatatca tggctggatt 
gcaacaaacc aacagtgaaa agattctcct gagctgggtc cgacaatcaa ctcgtaatta 
tccacaggtt aatgtaatca acttcaccac cagctggtct gatggcctgg ctttgaatgc 
tctcatccat agtcataggc cagacctatt tgactggaat agtgtggttt gccagcagtc 
agccacacaa cgactggaac atgcattcaa catcgccaga tatcaattag gcatagagaa 
actactcgat cctgaagatg ttgataccac ctatccagat aagaagtcca tcttaatgta 
catcacatca ctcttccaag ttttgcctca- acaagtgagc attgaagcca tccaggaagt 
ggaaatgttg ccaaggccac ctaaagtgac taaagaagaa cattttcagt tacatcatca 
aatgcactat tctcaacaga tcacggtcag. tctagcacag ggatatgaga gaacttcttc 
ccctaagcct cgattcaaga gctatgccta cacacaggct gcttatgtca ccacctctga 
ccctacacgg agcccatttc cttcacagca tttggaagct cctgaagaca agtcatttgg 
cagttcattg atggagagtg aagtaaacct ggaccgttat caaacagctt tagaagaagt 
attatcgtgg cttctttctg ctgaggacac attgcaagca caaggagaga tttctaatga 
tgtggaagtg gtgaaagacc agtttcatac tcatgagggg tacatgatgg atttgacagc 
ccatcagggc cgggttggta atattctaca attgggaagt aagctgattg gaacaggaaa 
attatcagaa gatgaagaaa ctgaagtaca agagcagatg aatctcctaa attcaagatg 
ggaatgcctc agggtagcta gcatggaaaa acaaagcaat ttacatagag ttttaatgga 
tctccagaat cagaaactga aagagttgaa tgactggcta acaaaaacag aagaaagaac 
aaggaaaatg gaggaagagc ctcttggacc tgatcttgaa gacctaaaac gccaagtaca 
acaacataag gtgcttcaag aagatctaga acaagaacaa gtcagggtca afctctctcac 
tcacatggtg gtggtagttg atgaatctag tggagatcac gcaactgctg ctttggaaga 
acaacttaag gtattgggag atcgatgggc aaacatctgt agatggacag aagaccgctg 
ggttctttta caagacatcc ttctcaaatg gcaacgtctt actgaagaac agtgcctttt 
tagtgcatgg ctttcagaaa aagaagatgc agtgaacaag attcacacaa ctggctttaa 
agatcaaaat gaaatgttat- caagtcttca aaaactggcc gttttaaaag cggatctaga 
aaagaaaaag caatccatgg gcaaactgta ttcactcaaa caagatcttc tttcaacact 
gaagaataag tcagtgaccc agaagacgga agcatggctg gataactttg cccggtgttg 
ggataattta gtccaaaaac ttgaaaagag tacagcacag atttcacagg. ctgtcaccac 
cactcagcca tcactaacac agacaactgt aatggaaaca gtaactacgg tgaccacaag 
ggaacagatc ctggtaaagc atgctcaaga ggaacttcca ccaccacctc cccaaaagaa 
gaggcagatt actgtggatt ctgaaattag gaaaaggttg gatgttgata taactgaact 
tcacagctgg attactcgct cagaagctgt gttgcagagt cctgaatttg caatctttcg 
gaaggaaggc aacttctcag acttaaaaga aaaagtcaat gccatagagc gagaaaaagc 
tgagaagttc agaaaactgc aagatgccag cagatcagct caggccctgg tggaacagat 
ggtgaatgag ggtgttaatg cagatagcat caaacaagcc tcagaacaac tgaacagccg 
gtggatcgaa ttctgccagt tgctaagtga gagacttaac tggctggagt atcagaacaa 
catcatcgct ttctataatc agctacaaca attggagcag atgacaacta ctgctgaaaa 
ctggttgaaa atccaaccca ccaccccatc .agagccaaca gcaattaaaa gtcagttaaa 
aatttgtaag gatgaagtca accggctatc aggtcttcaa cctcaaattg aacgattaaa 
aattcaaagc -atagccctga aagagaaagg acaaggaccc atgttcctgg at'gcagactt 
tgtggccttt acaaatcatt ttaagcaagt cttttctgat gtgcaggcca gagagaaaga 
gctacagaca atttttgaca ctttgccacc aatgcgctat caggagacca tgagtgccat 
caggacatgg gtccagcagt cagaaaccaa actctccata cctcaactta gtgtcaccga 
ctatgaaatc atggagcaga gactcgggga attgcaggct ttacaaagtt ctctgcaaga 
gcaacaaagt ggcctatact atctcagcac cactgtgaaa gagatgtcga agaaagcgcc 
ctctgaaatt agccggaaat accaatcaga atttgaagaa attgagggac gctggaagaa 
gctctcctcc cagctggttg agcattgtca aaagctagag gagcaaatga ataaactccg 
aaaaattcag aatcacatac aaaccctgaa gaaatggatg gctgaagttg atgtttttct 
gaaggaggaa tggcctgccc ttggggattc agaaattcta aaaaagcagc tgaaacagtg 
cagactttta gtcagtgata ttcagacaat tcagcccagt ctaaacagtg tcaatgaagg 
tgggcagaag ataaagaatg aagcagagcc agagtttgct tcgagacttg agacagaact 
caaagaactt aacactcagt gggatcacat gtgccaacag gtctatgcca gaaaggaggc 



' :gure 17 (cont.) 

3661 cttgaaggga ggtttggaga aaactgtaag cctccagaaa gatctatcag agatgcacga 
3721 atggatgaca caagctgaag aagagtatct tgagagagat cttgaatata ^tccaga 
3781 tgaattacag aaagcagttg aagagatgaa gagagctaaa gaagaggccc aacaaaaaga 
3841 agcgaaagtg aaactcctta ctgagtctgt aaatagtgtc atagctcaag ctccacctgt 
3901 agcacaagag gccttaaaaa aggaacttga aactctaacc accaactacc agtggctctg 
3961 cactaggctg aatgggaaat gcaagacttt ggaagaa 
6512 tctgttgag aaatggcggc gttttcatta 

6541 tgatataaag atatttaatc agtggctaac agaagctgaa cagtttctca gaaagacaca 
6601 aattcctgag aattgggaac atgctaaata caaatggtat cttaaggaac tccaggatgg 
6661 cattgggcag cggcaaactg ttgtcagaac attgaatgca actggggaag aaataattca 
6721 gcaatcctca aaaacagatg ccagtattct acaggaaaaa ttgggaagcc tgaatctgcg 
6781 gtggcagqag gtctgcaaac agctgtcaga cagaaaaaag aggctagaag aacaaaagaa 
6841 tatcttgtca gaatttcaaa gagatttaaa tgaatttgtt ttatggttgg aggaagcaga 
6901 taacattgct agtatcccac ttgaacctgg aaaagagcag caactaaaag aaaagcttga 
6961 gcaagtcaag ttactggtgg aagagttgcc cctgcgccag ggaattctca aacaattaaa 
7021 tgaaactgga ggacccgtgc ttgtaagtgc tcccataagc ccagaagagc a^gataaact 
7081 tgaaaataag ctca'agcaga caaatctcca gtggataaag gtttccagag ctttacctga 
7141 gaaacaagga gaaattgaag ctcaaataaa agaccttggg cagcttgaaa aaaagcttga 
7201 agaccttgaa gagcagttaa atcatctgct gctgtggtta tctcctatta ggaatcagtt 
7261 qgaaatttat aaccaaccaa accaagaagg accatttgac gttcaggaaa ctgaaatagc 
7321 agttcaagct aaacaaccgg atgtggaaga gattttgtct aaagggcagc atttgtacaa 
7381 ggaaaaacca gccactcagc cagtgaagag gaagttagaa gatctgagct ctgagtggaa 
7441 ggcggtaaac cgtttacttc aagagctgag ggcaaagcag cctgacctag ctcctggact 
7501 gaccactatt ggagcctctc ctactcagac tgttactctg. gtgacacaac ctgtggttac 
7561 taaggaaact gccatctcca aactagaaat gccatcttcc ttgatgttgg aggtacctgc 
7621 tctggcagat ttcaaccggg cttggacaga acttaccgac tggctttctc tgcttgatca 
7681 agttataaaa tcacagaggg tgatggtggg tgaccttgag gatatcaacg agatgatcat 
7741 caagcagaag gcaacaatgc aggatttgga acagaggcgt ccccagttgg aagaactcat 
7801 taccgctgcc caaaatttga aaaacaagac cagcaatcaa gaggctagaa caatcattac 
7861 ggatcgaatt gaaagaattc agaatcagtg ggatgaagta caagaacacc ttcagaaccg 
7921 gaggcaacag ttgaatgaaa tgttaaagga ttcaacacaa tggctggaag ctaaggaaga . 
79B1 agctgagcag gtctta'ggac aggccagagc caagcttgag tcatggaagg agggtcccta 
8041 tacagtagat gcaatccaaa agaaaatcac agaaaccaag cagttggcca aagacctccg 
8101 ccagtggcag acaaatgtag atgtggcaaa tgacttggcc ctgaaacttc .tccgggatta 
8161 ttctgcagat gataccagaa aagtccacat gataacagag aatatcaatg cctcttggag 
8221 aagcattcat aaaagggtga gtgagcgaga ggctgctttg gaagaaactc atagattact 
8281 gcaacagttc cccctggacc tggaaaagtt tcttgcctgg cttacagaag ctgaaacaac 
8341 tgccaatgtc ctacaggatg ctacccgtaa ggaaaggctc ctagaagact ccaagggagt 
8401 aaaagagctg atgaaacaat ggcaagacct ccaaggtgaa attgaagctc acacagatgt 
8461 ttatcacaac ctggatgaaa acagccaaaa aatcctgaga tccctggaag gttccgatga 
8521 tgcagtcctg ttacaaagac gtttggataa catgaacttc aagtggagtg aacttcggaa 
8581 aaagtctctc aacattaggt cccatttgga agccagttct gaccagtgga agcgtctgca 
8641 cctttctctg caggaacttc tggtgtggct acagctgaaa gatgatgaat taagccggca 
8701 ggcacctatt ggaggcgact ttccagcagt tcagaagcag aacgatgtac atagggcctt 
8761 caagagggaa ttgaaaacta aagaacctgt aatcatgagt actcttgaga ctgtacgaat 
8821 atttctgaca gagcagcctt tggaaggact agagaaactc taccaggagc ccagagagct 
8881 gcctcctgag gagagagccc agaatgtcac tcggcttcta cgaaagcagg ctgaggaggt 
8941 caatactgag tgggaaaaat tgaacctgca ctccgctgac tggcagagaa aaatagatga 
9001 gacccttgaa agactccagg aacttcaaga ggccacggat gagctggacc tcaagctgcg 
9061 ccaagctgag gtgatcaagg gatcctggca gcccgtgggc gatctcctca .ttgactctct 
9121 ccaagatcac ctcgagaaag tca'aggcact tcgaggagaa attgcgcctc tgaaagagaa 
9181 cgtgagccac gtcaatgacc ttgctcgcca gcttaccact ttgggcattc agctctcacc 
9241 gtataacctc- agcactctgg aagacctgaa caccagatgg aagcttctgc aggtggccgt 
9301 cgaggaccga gtcaggcagc tgcatgaagc ccacagggac tttggtccag catctcagca 
9361 ctttctttcc acgtctgtcc agggtccctg ggagagagcc atctcgccaa acaaagtgcc 
9421 ctactatatc aaccacgaga ctcaaacaac ttgctgggac. catcccaaaa tgacagagct 
9481 ctaccagtct ttagctgacc tgaataatgt cagattctca gcttatagga ctgccatgaa 
9541 actccgaaga ctgcagaagg ccctttgctt ggatctcttg agcctgtcag ctgcatgtga 
9601 tgccttggac cagcacaacc tcaagcaaaa tgaccagccc atggatatcc ■ tgcagattat 
9661 taattgtttg accactattt atgaccgcct ggagcaagag cacaacaatt tggtcaacgt 
9721 ccctctctgc gtggatatgt gtctgaactg gctgctgaat gtttatgata cgggacgaac 
9781 agggaggatc cgtgtcctgt cttttaaaac tggcatcatt tccctgtgta aagcacattt 
9841 ggaagacaag tacagatacc ttttcaagca agtggcaagt tcaacaggat tttgtgacca 
9901 gcgcaggctg ggcctccttc tgcatgattc tatccaaatt ccaagacagt tgggtgaagt 
9961 tgcatccttt gggggcagta acattgagcc aagtgtccgg agctgcttcc aatttgctaa 
10021 taataagcca gagatcgaag cggccctctt cctagactgg atgagactgg aaccccagtc 
10081 catggtgtgg ctgcccgtcc tgcacagagt ggctgctgca gaaactgcca agcatcaggc 
10141 caaatgtaac atctgcaaag agtgtccaat cattggattc aggtacagga gtctaaagca 



FIGtT * 17 (cont.) 

10201 ctttaat.-c gacatctgcc aaagctgctt tttttct^t cgagttgcaa" aaggccataa 
10261 aatgcactat cccatggtgg aatattgcac tccgactaca tcaggagaag atgttcgaga 
10321 ctttgccaag gtactaaaaa acaaatttcg aaccaaaagg tattttgcga- agcatccccg 
10381 aatgggctac ctgccagtgc agactgtctt- agagggggae aacatggaaa ctcccgttac 
104 41 tctgatcaac ttctggccag tagattctgc gcctgcctcg tcccctcagc tttcacacga 
10501 tgatactcat tcacgcattg aacattatgc tagcaggcta gcagaaatgg aaaacagcaa 
10561 tggatcttat ctaaatgata gcatctctcc taatgagagc atagatgatg aacatttgtt 
10621 aatccagcat tactgccaaa gtttgaacca ggactccccc ctgagccagc ctcgtagtcc 
10681 tgcccagatc ttgatttcct tag'agagtga ggaaagaggg gagctagaga gaatcctagc 
10741 agatcttgag gaagaaaaca ggaatctgca agcagaatat gaccgtctaa agcagcagca 
10801 cgaacataaa ggcctgtccc cactgccgtc ccctcctgaa atgatgccca cctctcccca 
10861 gagtccccgg gatgctgagc tcattgctga ggccaagcta ctgcgtcaac acaaaggccg 
10921 cctggaagcc aggatgcaaa ' tcctggaaga ccacaataaa cagctggagt cacagttaca 
10981 caggctaagg cagctgctgg agcaacccca ggcagaggcc aaagtgaatg gcacaacggt 
11041 gtcctctcct tctacctctc tacagaggtc cgacagcagt .cagcctatgc tgctccgagt 
11101 ggttggcagt caaacttcgg actccatggg tgaggaagat cttctcagtc ctccccagga 
11161 cacaagcaca gggttagagg aggtgatgga gcaactcaac aactccttcc ctagttcaag 
11221 aggaagaaat acccct'ggaa agccaatgag agaggacaca atgtaggaag tcttttccac 
11281 atggcagatg atttgggcag agcgatggag tccttagtat cagtcatgac agatgaagaa 
11341 ggagcagaat aaatgtttta caactcctga ttcccgcatg gtttttataa tattcataca 
11401 acaaagagga ttagacagta agagtttaca agaaataaat ctatattttt gtgaagggta 
11461 gtggtattat actgtagatt tcagtagttt ctaagtctgt tattgttttg ttaacaatgg 
11521 caggttttac acgtctatgc aattgtacaa aaaagttata agaaaactac atgtaaaatc 
11581 ttgatagcta aataacttgc catttcttta tatggaacgc attttgggtt gtttaaaaat 
11641 ttataacagt tataaagaaa gattgtaaac taaagtgtgc tttataaaaa aaagttgttt 
11701 ataaaaaccc ctaaaaacaa aacaaacaca cacacacaca catacacaca cacacacaaa 
11761 actttgaggc agcgcattgt tttgcatcct tttggcgtga tatccatatg aaattcatgg 
11821 ctttttcttt ttttgcatat taaagataag acttcctcta ccaccacacc aaatgactac 
11881 tacacactgc tcatttgaga actgtcagct gagtggggca ggcttgagtt ttcatttcat 
11941 atatctatat gtctataagt atataaatac tatagttata tagataaaga gatacgaatt 
12001 tctatagact gactttttcc attttttaaa tgttcatgtc acatcctaat agaaagaaat 
12061 tacttctagt cagtcatcca ggcttacctg cttggtctag aatggatttt tcccggagcc 
12121 ggaagccagg aggaaactac accacactaa aacattgtct acagctccag atgtttctca 
12181 ttttaaacaa ctttccactg acaacgaaag taaagtaaag tattggattt ttttaaaggg 
12241 aacatgtgaa tgaatacaca ggacttatta tatcagagtg agtaatcggt tggttggttg 
12301 attgattgat tgattgatac attcagcttc ctgctgctag caatgccacg atttagattt 
12361 aatgatgctt cagtggaaat caatcagaag gtattctgac cttgtgaaca tcagaaggta 
12421 ttttttaact cccaagcagt agcaggacga tgatagggct ggagggctat ggattcccag 
12481 cccatccctg tgaaggagta ggccactctt taagtgaagg attggatgat tgttcataat 
12541 acataaagtt ctctgtaatt acaactaaat tattatgccc tcttctcaca gtcaaaagga 
12601 actgggtggt ttggtttttg ttgctttttt agatttattg tcccatgtgg gatgagtttt 
12661 taaatgccac aagacataat ttaaaataaa taaactttgg gaaaaggtgt. aagacagtag 
12721 ccccatcaca tttgtgatac tgacaggtat caacccagaa gcccatgaac tgtgtttcca 
12781 tcctttgcat ttctctgcga gtagttccac acaggtttgt aagtaagtaa gaaagaaggc 
12841 aaattgattc aaatgttaca aaaaaaccct tcttggtgga ttagacaggt taaatatata 
12901 aacaaacaaa caaaaattgc tcaaaaaaga ggagaaaagc tcaagaggaa aagctaagga 
12961 ctggtaggaa aaagctttac tctttcatgc cattttattt ctttttgatt tttaaatcat 
13021 tcattcaata gataccaccg tgtgacctat aattttgcaa atctgttacc tctgacatca 
13081 agtgtaatta gcttttggag agtgggctga catcaagtgt aattagcttt tggagagtgg 
13141 gttttgtcca ttattaataa ttaattaatt aacatcaaac acggcttctc atgctatttc 
13201 tacctcactt tggttttggg gtgttcctga taattgtgca cacctgagtt cacagcttca 
13261 ccacttgtcc attgcgttat tttctttttc ctttataatt ctttcttttt. ccttcataat 
13321 tttcaaaaga aaacccaaag ctctaaggta acaaattacc aaatcacatg aagatttggt 
13381 ttttgtcttg. catttttttc ctttatgtga cgctggacct tttctttacc caaggatttt 
13441 taaaactcag atttaaaaca aggggttact ttacatccta ctaagaagtt taagtaagta 
13501 agtttcattc taaaatcaga ggtaaataga gtgcataaat aattttgttt taatcttttt 
13561 gtttttcttt tagacacatt agctctggag tgagtctgtc ataatatttg aacaaaaatt 
13621 gagagcttta ttgctgcatt ttaagcataa ttaatttgga cattatttcg tgttgtgttc 
13681 tttataacca ccgagtatta aactgtaaat cataatgtaa ctgaagcata aacatcacat 
13741 ggcatgtttt gtcattgttt tcaggtactg agttcttact tgagtatcat aatatattgt 
13801 gttttaacac caacactgta acatttacga attatttttt taaacttcag ttttactgca 
13861 ttttcacaac atatcagact tcaccaaata tatgccttac tattgtatta tagtactgct 
13921 ttactgtgta tctcaataaa gcacgcagtt atgttac 



FIGURE 18 (WW domain, SEQ ID NO:45) 

9371 acgtctgtcc agggtccctg ggagagagcc atctcgccaa acaaagtgcc 

9421 ctactatatc aaccacgaga ctcaaacaac ttgctgggac catcccaaaa tgacagagct 

9481 ctac 
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FIGURE 20 




D Contractile properties of EDL, soleus, and diaphragm 

muscies m wild-type, mdx % and dystrophin A71-78 mice. Muscle 
mass and specific force for mdx (A) and A71-78 (B) muscles 
were charted as a percentage of wild-type values. Significant dif- 
ferences (P < 0.05) are marked with an asterisk (*). 



FIGURE 21 (pBSX sequence, SEQ ID NO:46) 



CTAAATTGTAAGCGTTAATATTTTGTTAAAATC 

GCCGAAATCGGCAAAATCCCl^ATAAATC^AAAaAATAGACCGAGATAGGGTTGAGTGTTQT^ 

AGAGTCCACTATTAAAGAACGTGGACTCCAACGTCAAAGGGCGAAAAACCGTCTATCAGGGCGATGGCCCACTACG 

TGAACCATCACCCTAATC^GTTTTTTGGGGTCGAGGTGCCGTAAAGCACTAAATCGGAACCCTAAAGGQAGCCCC 

CGATTTAGAGCTTGACGGGGAAAGCCGGCGAACGTGGCGAGAAAGGAAGC^AAGAAAGCGAAAGGAGC^ 

(MGCGCTGGCAAGTGTAGCGGTCACGCTGCGCGTAACCACCACACCCGCCGCGCTTAATG^ 

GTCCC ATTCG CCATTC AGGCTGCGCAACTGTTGGG AAGGG CG ATCGGTGCGGG CCTCTTCGCTATTACGCCAGCTG 

GCGAAAGGGGG ATGTGCTG C AAGG CG ATT AAGTTGGGTAACJGCCAGGGTTTTCCCAGTC ACG ACGTTGTAAAACG A 

CX3GCCAGTGAGCGCGCGTAATACGACTCACTATACXJGCGAATTO 

CGGCCGCTCTAGAACT AGTGG ATCCCCCGGG C TGC AGG AATTCG GCCGCC^AGGCCACGCX3TAAGCTT ATCG AT AC 
^ CGTCGACCTCGAGGGGGGGCCCGGTACCCAGCTm^ 

Qatggtcatagctgtttcctgtgtgaaattgttatccgct^ 

fU TGTAAAGCCTGGGGTGCCTAATG AGTG AGCT AACTC AC ATT AATTGCGTTGCG CTC ACTG CCCGC TTTCCAGTCGG 

Ugaaacctgtcgtgccagctgcattaatgaatcggcc^ 

^CGCTTCCTQ3CTCACTOACTCGCTGCGCrcGGTCGTTCXK3OT 

m ATACGGTTATCC^CAGAATCAGGGGATAACGCAGGAAAGAACATGTGAGCAAAAGGCCAGCAAAAGGCCAGGAACC 
*T GT AAAAAGQ C CGCG TTG CTGG CGTTTTTCC AT AG GC TC CG CCCCC CTG A CG AG C ATC AC AAAAATCGACG CTC AAG 
it TCAGAGGTGGCGAAACCCX3ACAGGACTATAAAGATACCAGGCGTTTCCCCCTGGAAGCTCC 

gttccg accctgc cgctt ac c gg at ac c tg tc cg cctttc tc c ctt cggg aagcgtgg cg ctttctc at ag ct c ac 
8 gctx3taggtatctcaqttcggtgt aggtcgttcgctcc aag ctgggctgtgtgcacgaaccc cccgttcagcc cg a 
^ ccgctgcgrcttatc(xjgtaactatcx3tcttgagtccaacccgg 
0 ac tggtaacagg attagc agagcg aggtatgt aggcx3gtgctac agagttcttgm 
Oa<^ctagaaggacagtatttogtatctgcgctctgctgaagc^ 
(jl atccggcaaacaaaccaccgctggtagcggtggtttttttgttt 
□ tctcaagaagatcctttgatcttttctacggggtc^ 
q tcatgagattatc aaaaagg atcttc acct ag atcctttt aaatt aaaaatg aagtt^ 

atatgagtaaacttggtctgacagttacgaatgcttaatcagtgaggcacct 

tcatccatagttgcctgactccccgtcgtgtagataactacgatacgg^ 

caatgataccgcgagacccacx3ctcaccggcrc(^gatttatc^ 

cagaagtggtcctgcaactttatccgcctccatccagtctattaattgttgcc^ 

ccagtt aatagtttg otc aacgttgttgc c attg ct acaggc atcg tg gtgtcacg ctcgtcgtttggt atgg c tt 

cattcagctccggttcccj^cgatcaaggcgagttacatgatcccccatc 

cggtcctccgatcgttgtcagaagtaagttggccgcagtgttatcactc^ 

cttactgtcatgccatccgtaagatgcttttctgtgactggtgagta 

tg<xxkx)accgagttgctcttgcccggcgtcaataot 

CATC^TTGGAAAACGTTCTTCGGGGCGAAAACTCTCAAGGATCTTACC^ 
ACTCGTGCACCCAACTGATCTTCAGCATCriTT^ 

: ATGCCG CAAAAAAGG GAAT AAGGG CGACACGGAAATG TTG AATACTCAT ACT CTTC CTTTTTCAATATT ATTG AAG 
CATTTATCAGGGTTATTGTCTCATGAGCGGATACATATTTGAATCTATTTAGAAA 
: CGCACATTTCCCCGAAAAGTGCCAC 
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FIGURE 23 ("full-length 1 HDMD, 3EQ ID NO: 47) 

-numbering corresponds to human dystrophin SEQ ID N0:1 

1 gggattccct cactttcccc ctacaggact cagatctggg aggcaattac cttcggagaa 
61 aaacgaatag gaaaaactga agtgttactt tttttaaagc tgctgaagtt tgttggtttc 
121 tcattgtttt taagcctact ggagcaataa agtttgaaga acttttacca ggtttttttt 
181 atcgctgcct tgatatacac ttttcaaaat gctttggtgg gaagaagtag aggactgtta 
241 tgaaagagaa gatgttcaaa agaaaacatt cacaaaatgg gtaaatgcac aattttctaa 
301 gtttgggaag cagcatattg agaacctctt cagtgaccta caggatggga ggcgcctcct 
361 agacctcctc gaaggcctga cagggcaaaa actgccaaaa gaaaaaggat ccacaagagt ■ 
421 tcatgccctg aacaatgtca acaaggcact gcgggttttg cagaacaata atgttgattt 
481 agtgaatatt ggaagtactg acatcgtaga tggaaatcat aaactgactc ttggtttgat 
541 ttggaatata atcctccact ggcaggtcaa aaatgtaatg aaaaatatca tggctggatt 
601 gcaacaaacc aacagtgaaa agattctcct gagctgggtc cgacaatcaa ctcgtaatta 
661 tccacaggtt aatgtaatca acttcaccac cagctggtct gatggcctgg ctttgaatgc 
721 tctcatccat agtcataggc cagacctatt tgactggaat agtgtggttt gccagcagtc 
781 agccacacaa cgactggaac atgcattcaa catcgccaga tatcaattag gcatagagaa 
841 actactcgat cctgaagatg ttgataccac ctatccagat aagaagtcca tcttaatgta 
901 catcacatca ctcttccaag . ttttgcctca acaagtgagc attgaagcca tccaggaagt 
961 ggaaatgttg ccaaggccac ctaaagtgac taaagaagaa cattttcagt tacatcatca 
1021 aatgcactat tctcaacaga tcacggtcag tctagcacag ggatatgaga gaacttcttc 
1081 ccctaagcct cgattcaaga gctatgccta cacacaggct gcttatgtca ccacctctga 
1141 ccctacacgg agcccatttc cttcacagca tttggaagct cctgaagaca agtcatttgg 
1201 cagttcattg atggagagtg aagtaaacct ggaccgttat caaacagctt tagaagaagt 
1261 attatcgtgg cttctttctg ctgaggacac attgcaagca caaggagaga tttctaatga 
1321 tgtggaagtg gtgaaagacc agtttcatac tcatgagggg tacatgatgg atttgacagc 
1381 ccatcagggc cgggttggta atattctaca attgggaagt aagctgattg gaacaggaaa 
1441 attatcagaa gatgaagaaa ctgaagtaca agagcagatg aatctcctaa attcaagatg 
1501 ggaatgcctc agggtagcta gcatggaaaa acaaagcaat ttacatagag ttttaatgga 
1561 tctccagaat cagaaactga aagagttgaa tgactggcta acaaaaacag aagaaagaac 
1621 aaggaaaatg gaggaagagc ctcttggacc tgatcttgaa gacctaaaac gccaagtaca 
1681 acaacataag gtgcttcaag aagatctaga acaagaacaa gtcagggtca attctctcac 
1741 tcacatggtg gtggtagttg atgaatctag tggagatcac gcaactgctg ctttggaaga 
1801 acaacttaag gtattgggag atcgatgggc aaacatctgt agatggacag aagaccgctg 
1861 ggttctttta caagacatcc ttctcaaatg gcaacgtctt actgaagaac agtgcctttt 
1921 tagtgcatgg ctttcagaaa aagaagatgc agtgaacaag attcacacaa ctggctttaa 
1981 agatcaaaat gaaatgttat caagtcttca aaaactggcc gttttaaaag cggatctaga 
2041 aaagaaaaag caatccatgg gcaaactgta ttcactcaaa caagatcttc tttcaacact 
2101 gaagaataag tcagtgaccc agaagacgga agcatggctg gataactttg cccggtgttg 
2161 ggataattta gtccaaaaac ttgaaaagag tacagcacag afcttcacagg ctgtcaccac 
2221 cactcagcca tcactaacac agacaactgt aatggaaaca gtaactacgg tgaccacaag 
2281 ggaacagatc ctggtaaagc atgctcaaga ggaacttcca ccaccacctc cccaaaagaa 
2341 gaggcagatt actgtggatt ctgaaattag gaaaaggttg gatgttgata taactgaact 
2401 tcacagctgg attactcgct cagaagctgt gttgcagagt cctgaatttg caatctttcg 
24 61 gaaggaaggc. aacttctcag acttaaaaga aaaagtcaat gccatagagc gagaaaaagc 
2521 tgagaagttc agaaaactgc aagatgccag cagatcagct caggccctgg tggaacagat 
2581 ggtgaatgag ggtgttaatg cagatagcat caaacaagcc tcagaacaac tgaacagccg 
2641 gtggatcgaa .ttctgccagt tgctaagtga gagacttaac tggctggagt atcagaacaa 
2701 catcatcgct ttctataatc agctacaaca attggagcag atgacaacta ctgctgaaaa 
2761 ctggttgaaa atccaaccca ccaccccatc agagccaaca gcaattaaaa gtcagttaaa 
2821 aatttgtaag gatgaagtca accggctatc aggtcttcaa cctcaaattg aacgattaaa 
2881 aattcaaagc atagccctga aagagaaagg acaaggaccc atgttcctgg atgcagactt 
2941 tgtggccttt acaaatcatt ttaagcaagt cttttctgat gtgcaggcca gagagaaaga 
3001 gctacagaca atttttgaca ctttgccacc aatgcgctat caggagacca tgagtgccat 
3061 caggacatgg gtccagcagt cagaaaccaa actctccata cctcaactta gtgtcaccga 
3121 ctatgaaatc atggagcaga gactcgggga attgcaggct ttacaaagtt ctctgcaaga 
3181 gcaacaaagt ggcctatact atctcagcac cactgtgaaa gagatgtcga agaaagcgcc 
3241 ctctgaaatt agccggaaat atcaatcaga atttgaagaa attgagggac gctggaagaa 
3301 gctctcctcc cagctggttg agcattgtca aaagctagag gagcaaatga ataaactccg 
3361 aaaaattcag aatcacatac aaaccctgaa gaaatggatg gctgaagttg atgtttttct 
3421 gaaggaggaa tggcctgccc ttggggattc agaaattcta aaaaagcagc tgaaacagtg 
3481 cagactttta gtcagtgata ttcagacaat tcagccca'gt ctaaacagtg tcaatgaagg 
3541 tgggcagaag ataaagaatg aagcagagcc agagtttgct tcgagacttg agacagaact 



FIGURE 23 (cont.) 



3601 caaagaactt aacactcagt gggatcacat gtgccaacag gtctatgcca gaaaggaggc 
3661 cttgaaggga ggtttggaga aaactgtaag cctccagaaa gatctatcag agatgcacga 
3721 atggatgaca caagctgaag aagagtatct tgagagagat tttgaatata aaactccaga 
3781 tgaattacag aaagcagttg aagagatgaa gagagctaaa gaagaggccc aacaaaaaga 
3841 agcgaaagtg aaactcctta ctgagtctgt aaatagtgtc atagctcaag ctccacctgt 
3901 agcacaagag gccttaaaaa aggaacttga aactctaacc accaactacc agtggctctg 
3961 cactaggctg aatgggaaat gca.agacttt ggaagaagtt tgggcatgtt ggcatgagtt 
4021 attgtcatac ttggagaaag caaacaagtg gctaaatgaa gtagaattta aacttaaaac 
4081 cactgaaaac attcctggcg gagctgagga aatctctgag gtgctagatt cacttgaaaa 
4141 tttgatgcga cattcagagg ataacccaaa tcagattcgc atattggcac agaccctaac 
4201 agatggcgga gtcatggatg agctaatcaa tgaggaactt gagacattta attctcgttg 
4261 gagggaacta catgaagagg ctgtaaggag gcaaaagttg cttgaacaga gcatccagtc 
4321 tgcccaggag actgaaaaat ccttacactt aatccaggag tccctcacat tcattgacaa 
4381 gcagttggca gcttatattg cagacaaggt ggacgcagct caaatgcctc aggaagccca 
4441 gaaaatccaa tctgattt'ga caagtcatga gatcagttta gaagaaatga agaaacataa 
4501 tcaggggaag gaggctgccc aaagagtcct gtctcagatt gatgttgcac agaaaaaatt 
4561 acaagatgtc tccatgaagt ttcgattatt ccagaaacca gccaattttg agctgcgtct 
4 621 acaagaaagt aagatgattt tagatgaagt gaagatgcac ttgcctgcat tggaaacaaa 
4681 gagtgtggaa caggaagtag tacagtcaca gctaaatcat tgtgtgaact tgtataaaag 
4741 tctgagtgaa gtgaagtctg aagtggaaat ggtgataaag actggacgtc agattgtaca 
4801 gaaaaagcag acggaaaatc ccaaagaact tgatgaaaga gtaacagctt tgaaattgca 
4861 ttataatgag ctgggagcaa aggtaacaga aagaaagcaa cagttggaga aatgcttgaa 
4921 attgtcccgt aagatgcgaa aggaaatgaa tgtcttgaca gaatggctgg cagctacaga 
4 981 tatggaattg acaaagagat cagcagttga aggaatgcct agtaatttgg attctgaagt 
5041 tgcctgggga aaggctactc aaaaagagat tgagaaacag aaggtgcacc tgaagagtat 
5101 cacagaggta ggagaggcct tgaaaacagt tttgggcaag aaggagacgt tggtggaaga 
5161 taaactcagt cttctgaata gtaactggat agctgtcacc tcccgagcag aagagtggtt 
5221 aaatcttttg ttggaatacc agaaacacat ggaaactttt gaccagaatg tggaccacat 
5281 cacaaagtgg atcattcagg "ctgacacact tttggatgaa tcagagaaaa agaaacccca 
5341 gcaaaaagaa gacgtgctta agcgtttaaa ggcagaactg aatgacatac gcccaaaggt 
5401 ggactctaca cgtgaccaag cagcaaactt gatggcaaac cgcggtgacc actgcaggaa 
54 61 attagtagag ccccaaatct cagagctcaa ccatcgattt gcagccattt cacacagaat 
5521 taagactgga aaggcctcca ttcctttgaa ggaattggag cagtttaact cagatataca 
5581 aaaattgctt gaaccactgg aggctgaaat tcagcagggg gtgaatctga aagaggaaga 
5641 cttcaataaa gatatgaatg aagacaatga gggtactgta aaagaattgt tgcaaagagg 
5701 agacaactta caacaaagaa tcacagatga gagaaagaga gaggaaataa agataaaaca 
5761 gcagctgtta cagacaaaac ataatgctct caaggatttg aggtctcaaa gaagaaaaaa 
5821 ggctctagaa atttctcatc agtggtatca gtacaagagg caggctgatg atctcctgaa 
5881 atgcttggat gacattgaaa aaaaattagc cagcctacct gagcccagag atgaaaggaa 
5941 aataaaggaa attgatcggg aattgcagaa gaagaaagag gagctgaatg cagtgcgtag 
6001 gcaagctgag ggcttgtctg aggatggggc cgcaatggca gtggagccaa ctcagatcca 
6061 gctcagcaag cgctggcggg aaattgagag caaatttgct cagtttcgaa gactcaactt 
6121 tgcacaaatt cacactgtcc gtgaagaaac gatgatggtg atgactgaag acatgccttt 
6181 ggaaatttct tatgtgcctt ctacttattt gactgaaatc actcatgtct cacaagccct 
6241 attagaagtg gaacaacttc tcaatgctcc tgacctctgt gctaaggact ttgaagatct 
6301 ctttaagcaa gaggagtctc tgaagaatat aaaagatagt ctacaacaaa gctcaggtcg 
6361 gattgacatt attcatagca agaagacagc agcattgcaa agtgcaacgc ctgtggaaag 
6421 ggtgaagcta caggaagctc tctcccagct tgatttccaa tgggaaaaag ttaacaaaat 
6481 gtacaaggac cgacaagggc gatttgacag atctgttgag aaatggcggc gttttcatta 
6541 tgatataaag atatttaatc agtggctaac agaagctgaa cagtttctca gaaagacaca 
6601 aattcctgag aattgggaac atgctaaata caaatggtat cttaaggaac tccaggatgg 
6661 cattgggcag cggcaaactg ttgtcagaac attgaatgca actggggaag aaataattca 
6721 gcaatcctca aaaacagatg ccagtattct acaggaaaaa ttgggaagcc tgaatctgcg 
6781 .gtggcaggag gtctgcaaac agctgtcaga cagaaaaaag aggctagaag aacaaaagaa 
684.1 tatcttgtca gaatttcaaa gagatttaaa tgaatttgtt ttatggttgg aggaagcaga 
6901 taacattgct agtatcccac ttgaacctgg aaaagagcag 'caactaaaag aaaagcttga 
6961 gcaagtcaag ttactggtgg aagagttgcc cctgcgccag ggaattctca aacaattaaa 
7021 cgaaactgga ggacccgtgc ttgtaagtgc tcccataagc ccagaagagc aagataaact 
7081 tgaaaataag ctcaagcaga caaatctcca gtggataaag gtttccagag ctttacctga 
7141 gaaacaagga gaaattgaag ctcaaataaa agaccttggg cagcttgaaa aaaagcttga 
7201 agaccttgaa gagcagttaa atcatctgct gctgtggtta tctcctatta ggaatcagtt 
7261 ggaaatttat aaccaaccaa accaagaagg accatttgac gttcaggaaa ctgaaatagc 
7321 agttcaagct aaacaaccgg atgtggaaga gattttgtct aaagggcagc atttgtacaa 
7381 ggaaaaacca gccactcagc cagtgaagag gaagttagaa gatccgagct ctgagtggaa 



FIGURE 23 (cont.) 

7441 ggcggtaaac cgtttacttc aagagctgag ggcaaagcag cctgacctaj ctcctggact 
•7501 gaccactatt ggagcctctc ctactcagac tgt"« « ttgatgttgg aggtacctgc 
7561 taaggaaact gccatctcca ""agaaat jccatcttcc tgcttgatca 
7621 tctggcagat ttcaaccggg cttggacaga a ="*"^° natatcaacq agatgateat 
7681 agttataaaa tcacagaggg tgatggtggg ^"gag ^tatcaacg ag 9 
7741 caagcagaag gcaacaatgc aggatttgga acagaggcgt ccccagttgg aa 9 
7801 tacSgctgcc caaaatttga aaaacaagac cagcaatcaa 9^agaa c 
7861 ggatcgaatt gaaagaattc agaatcagtg ^atgaagta «ag ctaaggaaga 
7 921 gaggcaacag ttgaatgaaa tgttaaagga ttcaacacaa- tggc gy y aggq ta C cta 
7981 Igltgagcag gtcttaggac a99Ccagagc "agcttgag tcatggaagg 999 
8041 tacagtagat gcaatccaaa agaaaatcac agaaaccaag "9"gg tc * atta 
8101 ccagtggcag acaaatgtag atgtggcaaa tgacttggcc ctgaaacttc tccggg 
8161 ttctgcagat gataccagaa aagtccacat ^aacagag a«at y atagat ? a ct 
8221 aagclttcat aaaagggtga fcjagcgaga ggetgctttg. ^ajaaactc £ gaaacaac 
8281 gcaacagttc cccctggacc tggaaaagtt tcttgcctgg y * t 

8341 tgccaatgtc ctacaggatg ctacccgtaa gga^ggctc ctagaagact ccaaggg y 
8401 aaaagagctg atgaaacaat ggcaagacct c« a 9^| aa |^| ggaa g gttccgatga 
8461 ttatcacaac ctggatgaaa acagccaaaa "tcctgaga "cccggaag g y * 
8521 tgcagtcctg ttacaaagac gtttggataa catgaacttc »»9W«9t9 « <. yy 
8581 aiagtctctc aacattaggt cccatttgga agccagttct 9accagtgga agcgtctgc 
8641 cctttctctg caggaacttc tggtgtggct a cagctgaaa,gatgatgaat taagccggca 
8701 ggcacctatt ggaggcgact ttccagcagt tcagaagcag aa cgatgtac atagggcctt 
8761 caagagggaa ttgaaaacta aagaacctgt "tcatgagt Mtrttgaga «gtacg 
8821 atttctgaca gagcagcctt tggaaggact agagaaactc ^"99^ ^agagag 
popi acctcctaaa qaqagagccc agaatgtcac tcggcttcta cgaaagcagg ^^t^Sit 
8941 clatactgag tggglalaat tgaacctgca ctccgctgac tggcagagaa "atagatga 
900 gaccc?tgt! aglctrccagg alcttcaaga ggccacggat £9ctggacc tcaagctgcg 
9061 Scaagctgag gtgatcaagg gatcctggca 9c«gtgggc fatctcctca "gactctct 
9121 ccaagatcac ctcgagaaag tcaaggcact tcgaggagaa attgcgcccc .^aaag y 
9181 cgtgagccac gtcaatgacc ttgctcgcca gcttaccact ttgggcattc ^"ctcacc 
9241 gtattlcctc Igcactctgg aagacctgaa caccagatgg aagcttctgc aggtggccgt 
9301 cgaggaccga gtcaggcagc tgcatgaagc ccacagggac tttggtccag "tctcagca 
9361 ctttStttcc acgtctgtcc agggtccctg ggagagagcc atctcgccaa ^aaagtgcc 
9471 ctactatatc aaccacgaga ctcaaacaac ttgctgggac catcccaaaa tgacagagcc 
94B1 cttccaqtct ttagctgacc tgaataatgt cagattctca gcttatagga ctgccatgaa 
9541 aotccglaga ctgwgiagg ccctttgctt ggatctcttg agcctgtcag ctg^tgtga 
Hoi tgccttggll cagca«acc tcaagclaaa tgaccagccc atggatatcc tgcagattat 
9661 tlattgtttg accactattt atgaccgcct gg a g«agag c a «acaatt tgtf*aacgt 
Q721 cectctctoc qtqgatatgt. gtctgaactg gctgctgaat gtttatgata cgggacgaac 
9781 aggglggatl cgtgtcctgt cttttaaaac tggcatcatt. tccctgtgta aagcacattt 
9841 agalglcaag tlclgatacc ttttcaagca agtggcaagt tcaacaggat '"gtgacca 
9901 alacaoacto ggcctccttc tgcatgattc tatccaaatt ccaagacagt • tgggtgaagt 
9961 tSS'Sttt gggggcagta acattgagcc aagtgtccgg agctgcttcc aatttgctaa 
10021 tlataagcca llgitcgaag cggccctctt cctagactgg atgagactgg "ccccagtc 
10081 MtgStgtgg atgcccgtcc tgcacagagt ggctgctgca gaaactgcca agcatcaggc 
10141 caaatgtalc atctgcaaag agtgtccaat cattggattc aggtacagga gtctaaagca 
iolol I"?««tt gaca?ctgcl alagctgctt "tttctggt cgigttgca. aaggccataa 
10261 aatgcactat 6ccatggtgg aatattgcac tccgactaca -tcaggagaag atgttcgaga 
10321 ctttgccaag gtactaaaaa acaaatttcg aaccaaaagg tattttgcga agcatccccg 
10381 aatgggctac ctgccagtgc agactgtctt agagggggac aacatggaaa c 

l°050l ?gatl C ctca? t^cft?g lltlTttll tagcaggcta gcagaaatgg aaaacagcaa 
10561 tgutcttat ctaaatgata gcatctctcc taatgagagc atagatgatg aacatttgtt 
16621 altccagcat tactgccaaa gtttgaacca ggactccccc. ctgagccagc. ctcgtagtcc 
10681 tgcccagatc ttgatttcct tagagagtga ggaaagaggg 9 a 9ctagaga 9 a atcctagc 
10741 aqatcttgag gaagaaaaca ggaatctgca agcagaatat gaccgtctaa agcagcagca 
10801 SgncatSu ggclcgtccc cactgccgtc ccctcctgaa atgatgccca ««toccca 
10861 gagtccccgg gatgctgagc tcattgctga ggccaagcta ctgcgtcaac a «aaggccg 
10921 cctqqaagcc aggatgcaaa tcctggaaga ccacaataaa cagctggagt cacagttaca 
1°0981 tlllltlln clgctgctgg agcal^ccca ggcagaggcc "agtgaatg gcacaacggt 
11041 gtcctctcct tctacctctc tacagaggtc cgacagcagt "gcctatgc tgctccgagt 
11101 ggttggcagt caaacttcgg actccatggg tgaggaagat e«ctcagtc =tccccagga 
11161 ScalgcaSa gggttagagg aggtgatgga gcaactcaac aactccttcc ctagttcaag 
11221 aggaagaaat Icccctggaa agccaatgag agaggacaca atgtaggaag tcttttccac 



FIGURE 23 (cont.) 

11281 atggcagatg atttgggcag agcgatggag 
11341 ggagcagaat aaatgtttta caactcctga 
11401 acaaagagga ttagacagta agagtttaca 
114 61 gtggtattat actgtagatt tcagtagttt 
11521 caggttttac acgtctatgc aattgtacaa 
11581 ttgatagcta aataacttgc catttcttta 
11641 ttataacagt tataaagaaa gattgtaaac 
11701 ataaaaaccc ctaaaaacaa aacaaacaca 
11"7 61 actttgaggc agcgcattgt tttgcatcct 
11821 ctttttct.tt ttttgcatat taaagataag 
11881 tacacactgc tcatttgaga actgtcagct 
11941 atatctatat gtctataagt atataaatac 
12001 tctatagact gactttttcc attttttaaa 
12061 tacttctagt cagtcatcca ggcttacctg 



tccttagtat cagtcatgac agatgaagaa 
ttcccgcatg gtttttataa tattcataca 
agaaataaat ctatattttt gtgaagggta 
ctaagtctgt tattgttttg ttaacaatgg 
aaaagttata agaaaactac atgtaaaatc 
tatggaacgc attttgggtt gtttaaaaat 
taaagtgtgc tttataaaaa aaagttgttt 
cacacacaca catacacaca cacacacaaa 
tttggcgtga tatccatatg aaattcatgg 
acttcctcta ccaccacacc aaatgactac 
gagtggggca ggcttgagtt ttcatttcat 
tatagttata tagataaaga gatacgaatt 
tgttcatgtc acatcctaat agaaagaaat 
cttggt 
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SEQ ID NO i 87 wild' type mouse enhancer) - ccacta 

2150 2200 

2300 

c^rroccnuoccTCACCcccAC^ 



SEQ 



ID NO:88( x 2R' mouse mutant enhancer)- ccacta 

^ 2150 "!° 

^ CCCCTCTAGGCTTCCCATm^A^ 

2300 

^ CTGCTtXCTCAGCCT^CCXCCACCC^ 

OS 

® SEQ id NO:89( 1 S5' mouse mutant enhancer)- ccacta 

■ 2300 
2 cnt^CCTCAGCCTCUO^AC^ 

Ql 



GTGOAT 

SEQ ID NOi90t % 2RS5' mouse mutant enhancer) - ccacta 

21S0 3100 

COQOTCTAOCXrrtWXCATtnA^ 

• »so ' T 

CTOCTOCCTOAGCCTtttOC^^ 
GTC3GAT 

SEQ ID NOt9i {* truncated 2RS5' mouse mutant enhancer) - ccacta 



2150 "°° 



OOOGTCTAOOCTOCCCA 

2250 

CTOCTtXCTtUCCCPtlJWloaJTTJLC^^ 
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SEQ ID NO 92 (mouse promoter sequence, -944 to +7) 

GTGaAQCXGCCTGCACTGGGCITC^ 

CCCACCCOOtn-CTCClXnTATCCTTCCTOOaCCrOCKW 

AOTCKnTKCTGCCATCCATCOrrCTQWWCCTCCX^ 

GCCA^CTGGGTGCCATAOTTAXlACX!CTCrrCTCAAAAAACCAAAAAO 

ATTGTTACATOTCTt3AC?CKXAGCCTGGACTACATA0GOTrrC^ 

CWKTGCTAATCCTCACCAaaCAAT<UVAOCCTGOT^ 

A«XTCrrAACAOOCIT<X30GQCC^^ 

ACJUrrCAGCXCTTAOTTTAOOA^CCAOTOJUW^ 

<K3UXXnXJCCCGGGCAACGAJGCTGAA^ 

AACXX»GGC(^CAOGQaCT<K:CCCOGOaTCAC 

SEQ ID NO: 93 ! mouse promoter sequence, -358 to +7) 

AATCAAJGGClX^TCXiGGGACTGAGGGCfcGGCT 
OJU^O00CCAOCTCrr0CCCaKCACCrACACTC*G<>OT 

GTCACMXCTQTAOaCICCTCTATATAACCCAOOGGCfcCAOOOGCT^^ 

SEQ ID NO:-94 (mouse promoter sequence , -80 to +7) 

CCTCCCTGOOQACAOCCCCTCCT>CX^ H» I *TfcT*AOXJUa[MCCXCAOQOOCTOCXX»»lOTCAC 
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Cr/stsl ;=:£2S Th;3t, F.: :•: 
1CI1 South Ci a::-: 

T«!s3ncra: (702; 20 £-2 73 
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! hsrsby recusst access : - : r.: : =; 



• RECEIVED 

MAY 0 6 2004 ' 




..„.._....,_.._, J FiteFtftfca-fffetitaijfeJnati, ,, 

ap=!i=aScn, a*ich :~ \±=r£f\ei. !="., if-.z !;!\<z:. = ;«7:s::i B-sair.*-;, ;r. ins rciisvw.-.s ccrj.^sr.: -.ss sr.:v.-. :r= 
attschrr .er.i'i: v 



'j-^i.-; £f=*=5 c- 1 --^ dr-\ : ,-zrr~ .i!.-cr-- -V:-. 




Unitec Sict53 ralsnt N : u.-Ccr 
WIFO Pub. No-. 
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, lir.s. 



Rented information scout Access to Pending Applications (27 CFr. 1 .14?: 
Dirscl ECC35S to c-fidir.^ aeelicsifcRs is n*i available^ !he public ccci« -ay be available sr.2 ma-, c* 
purchased frcn the OficsVpubite Rzcsri* upon psyrn=nt of the spprcpnaSs fee (37 SF= 1.1 5(b)}, *i fwiews: 
For publish* 4 soc-iicsfcns that ara sti! ogig nc. a mencs- of the public msy obtain = ccs-y c-f: 
tha fHa contents; 

ins pending application as orianaHy Sec; or 
any document In the file of {Tie psndins-sppiicsfic-n. 
Ffar unoi&Hsiie c scdicsSons that ave sSil candinc: / 

(1) tf wi^ h^sfitcf tha adding ssr lfcsgcr: ta claimed under U^. i2i. a-^tc :r,-ar.G t r.s-: ^ 
application that has: (a> issued ss a U.S. patent cr (b) published as a ata&wry in van ticn rsg^sa;-. a .j 
patent appiics&n pubiie&Son. or an international patent applicsison puhiicsSsn i n accordance with ?C t 
Arsde 21 (2) , £ rrisn-bsr of ine p u biic may obtain a c ocy of: 

the fils contents; 

the pending application aa. originally -fed; or 

snv document in ti"e file of the Denalng spplicsron. 

(2) It 9» aoQlicstlcn Is l^.co-oreted bv rsfitrsr.ca cr others = icaiSflsd in a U.S. psfe-*, £ ssiuiffiy f" verier: 
rejistrsticn, £ U.S. pstsnt appiicsticn publicslion, cr =n Intamsscna! pstsi^t sp : j!tc3t;c-r, pudic=t!on.in 
accordancs win FCT Aj^de 21 (2), £ msrnbsr c-f public .T^y cfcfeir, a ccpy of: 

the oancing =ppfIc£t!or; S3 ongir.a^y filed. 






